SPIRAL MINIMAL SURFACES* 
BY 
J. K. WHITTEMORE 
1. Introduction. The equations of any minimal surface S may be writtent 


(1) F(u)du—5 (1 + 0°) 6(v)dv = Us + Ve 


= f uk (u)du + [ = U;+ V3. 


If the surface is real F and ® are conjugate functions, and for a real point with 
a real tangent plane u and v have conjugate values.{ The direction cosines 
of the normal to S are 

Ute , t(v—4) uv — 1 


(2) X= ~ ~ 


The linear element ds is given by 
ds? = (uv + 1)? F(u)®(v) dudv. 


The differential equations of the lines of curvature and of the asymptotic 
lines of S are respectively 


F(u)du?—@(v)d =0, F(u)dw? + de? = 0. 


The curves of any surface along each of which the tangent plane to the 
surface makes a constant angle with a fixed direction are called the Minding 
parallels of the surface with reference to that direction; the curves of the 
surface along each of which the normal is parallel to a plane containing the 
fixed direction are called the Minding meridians.§ We shall consider Minding 

* Read before the American Mathematical Society, December 27, 1916. 

+t Darboux, Théorie des surfaces, vol. 1, 2d ed., p. 340. These equations are due to En- 
neper (1864) and Weierstrass (1866). 

t Darboux, 1. p. 343. 

§ These curves were first studied by Minding, Journal fiir Mathematik, 
vol. 44 (1852), p. 66. Darboux, 1. c., p. 367. 
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parallels and meridians of minimal surfaces with reference to the Z axis. If 
in (1) we set u = re®** and v = re~**, where for real points of a real surface r 
is real and positive and ¢ is real, it is evident from (2) that the curves (r), 
that is r constant, are the Minding parallels and the curves (¢) the Minding 
meridians. In the study of spiral minimal surfaces, the subject of this paper, 
the curves (r) play a part analogous to that taken by the curves (¢@) in the 
discussion of minimal surfaces applicable to surfaces of revolution. 

If F and ®@ in (1) are replaced by e* F and e~“ ® respectively, where a 
is a real constant, we obtain the equations of a family of minimal surfaces 
called associate to S.* Corresponding points of associate minimal surfaces 
are points given by the same parameter values wu, ». Schwarz proved that 
the locus of points on a family of associate surfaces corresponding to given 
values of uw, v is an ellipse whose center is the origin. These ellipses we call 
Schwarz ellipses. We have found the equations of the locus L of the vertices 
of the Schwarz ellipses for a given minimal surface with equations (1) to bef 


z=U, y =U, Sut 


=U 44 


In (3) the first radical of each second member has the same determination; 
the second is the reciprocal and conjugate of the first for real points of a real 
surface S. The four vertices of each ellipse are given by the four determina- 
tions of the first radical. Evidently ZL consists generally of four nappes 
symmetrical in pairs with respect to the origin. 

A spiral surface is most simply defined as follows:{ A spiral surface is the 
locus of the different positions of any curve which is rotated about an axis 
and at the same time subjected to a homothetic transformation with respect 
to a point of the axis in such a way that the tangent to the locus described 
by any point of the curve makes a constant angle with the axis. The spiral 
surfaces include surfaces of revolution but we shall hereafter in speaking of a 
spiral surface suppose it not to be a surface of revolution. Evidently the 
locus of each point of the curve generating a spiral surface lies on a right cir- 

* Associate minimal surfaces were first systematically considered by H. A. Schwarz, 
Miscellen aus dem Gebiete der Minimalflichen, Journal fir Mathematik, vol. 
80 (1875), p. 286, see also Darboux, |. ¢., p. 379. 

tAmerican Journal of Mathematics, vol. 40 (1918), p. 87. 

t Eisenhart, Differential Geometry, p. 151. Spiral surfaces were first studied by Maurice 
Lévy in a paper published in Comptes Rendus, vol. 87 (1878), p. 788. He gave 
them as an example of surfaces such that EZ, F', G, the coefficients in the square of the linear 
element, are homogeneous functions of the parameters of any degree other than —2. Lévy 
named them “surfaces pseudo-moulures logarithmiques.”’ 


(3) 
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cular cone whose axis is the axis of the surface, cuts all elements of the cone at 
the same angle, and is projected on a plane perpendicular to the axis in a 
logarithmic spiral. Such a curve we shall call simply a spiral.* The equa- 
tions of any spiral surface may be written 


(4) x = pe’ cos(w + hv), y = pe” sin(w + hv), 2 = fe", 


where h is constant, p, w, and ¢ are functions of u alone, and the curves (1) 
are the spirals. Darboux has provedf that the orthogonal trajectories of the 
spirals of a spiral surface whose generating curve is given may be found by 
quadratures, and the square of the linear element written in the form 


ds? = U2 (du? + de), 


where U is a function of u alone. If U is given, h an arbitrary constant, and 
¢, p, w determined from 


(6) +5") = U*, 8, 


the surface (4) has the linear element in the form given, so that the solution 
of (5) determines a three-parameter family of spiral surfaces applicable to a 
given spiral surface. 

Lie proved§ that all real minimal surfaces applicable to spiral surfaces are 
given by writing in (1) 


(6) F (u) = 2tnt ( v) 


where ¢ and ¢ are conjugate constants and m and n are real constants. The 
number — 2 is introduced in the exponents only to simplify future formula- 
tions. No essential restriction is imposed by taking the modulus of c as unity, 
so that we shall suppose c = e* where a is real. If in (6) nm = 0 equations 
(1) and (6) give all minimal surfaces applicable to surfaces of revolution.|| 
That these surfaces are applicable to surfaces of revolution if n is zero, and 
when n is not zero to spiral surfaces, appears from the linear element of (1) 
and (6) in terms of r, @ 


ds? = (7? 4.1)? (dr? + dg”). 


* These curves have been named “‘cylindro-conical helices.”” See Encyclopddie der mathe- 
matischen Wissenschaften, Bd. III, 3, Heft 2, 3, p. 252. 

L.c., pp. 148-150. 

t Proved by Maurice Lévy, as stated by Darboux (1. c., p. 149). Of the three constants one 
is additive to w and has no effect on the form of the surface, so that there are generally given 
by the integration of (5) a two parameter family of different surfaces. 

§Mathematische Annalen, vol. 15 (1879), p. 503. Darboux, |. c., p. 396. 

|| First proved by Schwarz, Miscellen, p. 296. 
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Evidently the Minding parallels (r) of (1) correspond to the spirals of the 
spiral surface, and, when n is zero, to the parallels of the surface of revolution. 
If m = 0, n + O equations (1) and (6) give all minimal surfaces which are 
spiral surfaces.* 

A. Ribaucour proved{ that the minimal surfaces associate to a minimal 
surface applicable to a surface of revolution are congruent to the given minimal 
surface except for m = 0, which value gives the minimal helicoids. Darboux 
states without proof that the minimal surfaces associate to a minimal surface 
applicable to a spiral surface are similar to the given surface. In the following 
section we prove these theorems, noting that exception must be made to 
Darboux’s statement in the case of spiral minimal surfaces, and proving also 
a theorem, which we believe has not previously been stated, concerning the 
similarity to itself of the general surface given by (1) and (6). The remainder 
of this paper deals entirely with spiral minimal surfaces: we consider families 
of minimal surfaces associate to a surface of this kind, their Schwarz ellipses 
and the locus L of the vertices of these ellipses, the envelope of the family of 
associate surfaces, and the evolute surface; it is proved that on every spiral 
minimal surface there are an infinite number of plane spiral lines of curvature, 
the intersections of the surface with the plane part of L, an infinite number of 
spiral asymptotic lines, the intersections of the surface with the right conoids 
of L, and a single spiral geodesic; we show that on one member of a family 
of associate spiral minimal surfaces all the plane spiral lines of curvature 
except one are double curves; we discover certain symmetries of these surfaces, 
and finally obtain two characteristic properties of spiral minimal surfaces 
connected with the locus LZ and with the Minding parallels. 

2. Darboux’s similarity theorem. The first two of equations (1) may be 
combined and written 


r+yi= fe + &(0)de. 
For the minimal surfaces applicable to spiral surfaces we substitute from (6) 
F(u) =o yr tt, (n +0). 
When this surface is rotated about the Z axis through the angle 6, 


=, 


where x’, y’, 2’ are the new coordinates of z, y, z. Sufficient conditions 
that this surface in the new position be similar to that surface for which 


* First proved by Darboux, I. ¢., p. 359. 

t “Etude sur les élassoides ou surfaces & courbure moyenne nulle.””’ Memoires Cour- 
onnés de PAcadémie Royale de Belgique, vol. 44 (1882), chap. XX. 
Ribaucour disregards the cases m = 0, +1; Darboux proves the theorem for all cases except 
m = 0 (l. c., pp. 395, 396). 
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a = 0 in the original position with the origin as center, C as the ratio of 
similarity, and all constants of integration chosen as zero, are that wu’ and v’ 
may be determined so that, with constant real values for 6 and C, 


u” F(u’) du’ + ®(v') dv’) = C(— udu + dv), 
F(u') du’ + v' @(0') dv’ = C( wr du + dv). 


These equations are satisfied by constant real values of 6 and C by taking 
u’ and v’ as conjugate functions of u and » alone respectively if the same is 
true of the following four equations: 


dy! = Cum du, dy! = dy, 
et du’ = du, dy! = dy. 
From the last group of equations 
u’ = ue~**, vo’ = ve**, 
Setting these values of wu’ and v’ in the four equations, we have 


so that we must have a — mB = 0. If mis not zero the equations are satisfied 
by the constant real values, 


B=", C=em, 
™m 


If m is zero the proof fails. Equations (7) express the two theorems: (I) 
Minimal surfaces associate to a minimal surface applicable to a surface of 
revolution (m = 0) are congruent to that surface and may be made to coincide 
with it by a rotation about the Z axis, except in the case of the minimal 
helicoids (m = 0). That the associate minimal helicoids are not similar is 
well known.* (II) Minimal surfaces associate to a minimal surface applicable 
to a spiral surface (n + 0) are similar and are brought into similar position 
by a rotation about the Z axis, except in the case of the spiral minimal surfaces 
(m =0). That the associates of a spiral minimal surface are not similar 
to that surface will appear in § 6 of this paper. 

If we observe that the associate to any minimal surface given by a = 2k7, 
where k is any integer, coincides with that surface, and that the associate 
a = (2k+1)z7 is symmetrical with respect to the origin to that surface, 
we have in (7) the proof of a theorem due to Ribaucour: A minimal suiface 
applicable to a surface of revolution, not a minimal helicoid, is brought into 
coincidence with itself by rotation about the Z axis through the angle 2kr/m; 
it appears also, though not stated by Ribaucour, that such a surface is brought 


# See, for example, Scheffers, Theorie der Flachen, 2d ed., pp. 362, 363. 
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into coincidence with its symmetry with respect to the origin by rotation about 
the Z axis through the angle (2k ++1)2/m. Finally it appears from (7) that 
a minimal surface applicable to a spiral surface (m + 0, n + 0) is similar to 
itself and to its symmetry with respect to the origin in an infinite number of 
positions if m is irrational, in a finite number if m is rational. 
3. Equations and linear element of S,,. We denote by S,, the spiral 

minimal surface given by substituting in (1) 

(9) = 
Performing the integrations indicated, taking all constants of integration as 
zero, then writing 

u = re**, = re~**, n= —cotB, 


we have the equations of S,,, in terms of the real parameters r, @ 


+? sin 


= —¢ "sin B[rcos(¢ —-B+a+nlogr) 


cos (@ — a — nlogr)], 


z= —2e"* tan@sin(a+nlogr). 


In these equations n is retained for the sake of abbreviation. If we let 


nlogr =u, —n(¢— 6) =2, h= 
and set 


= 2e-"* sin B sin (u + @) sinh = 


u 
— 2e-"* sin B cos (u + a) cosh = 
= — tanBsin(u+a), 
equations (8) take Darboux’s form (4) thus proving that S,, is actually a 


spiral surface and that the spirals are the Minding parallels (r). 
The intersection of S,, with the zy plane is given by 


a+nlogr = kr, 


where k is any integer, and each such value of k& gives a curve, as appears from 
the first two equations of (8), whose polar equation, codrdinates R, 0, is 
R = e "Ot? = gOt°"B | so that the surface cuts the plane in an infinite 


1 
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number of congruent logarithmic spirals which cut all radii vectores at the 


angle 6. 
The linear element of (8) is given by 


ds? = (r? + 1)? (dr + r dg’). 
Substituting the values u, v given after (8) 


ds? = U? (du? + dv’), U? = tan? B cosh? 


We may determine the spiral minimal surfaces S,,, applicable to a given spiral 
minimal surface Sy) which are given by the integration of (5) as follows. Let 
the subscript apply to all quantities relating to So; from U? = Uj follows 
n? = nj; the first of equations (5) gives 


= tan? B = U2? — U2 


In this equation we may replace Uy) by U, and are led to the condition 
h? n? = h? 2 = 1. The solutions of (5) with h = — 1/mp substituted in (4) 
give equations (8) where n is replaced by mo, @ is the constant introduced by 
the integration of the equation for ¢, and the constant additive to w, which 
affects only the position of the surface, is taken as zero. The integration of 
(5) with h = 1/no leads to the surfaces S_,,,, surfaces not associate to So 
but symmetrical with respect to the X axis to the associate surfaces, as will 
appear in §9. Spiral surfaces, not minimal, applicable to So would be given 
by the substitution in (4) of the solutions of (5) for values of h not equal to 
+ 1/m, but it appears to be impossible to integrate the equation for ¢ for 
such values. 
4. The locus L for S,,. From equations (1) for Sy» we find 


= (uo); 
that part of L given by (3) where the first radical is taken as (uv)~"*/?,, which 
we call I, , has the equations, obtained by substituting this value and writing 
as before u = re**, » = re~*', 
e7ne 1 
r 


z= 


sin B sin — 8), 


sin 8 cos (¢ — 8), 
z:=0. 


Points common to J, and S,, are given by 
(uv)-*4? = or a+nlogr = 2kr. 


The equations of L2, symmetrical with respect to the origin to I,, are 
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obtained by changing the signs of the second members of (9). Points common 
to I, and S,, are given by a+ nlogr = (2k+1)za. The curves common 
to I,, Iz, and S,, are the logarithmic spirals in which the latter cuts the 
xy plane. 

Choosing the first radical in (3) as i (uv)~"*? we obtain for L3, part of the 
locus L, the equations 


sin 8B cos (@— 8B), y sin 8 sin (@— 8), 


z= — 2e"* tan8B. 


(10) 


Points common to JL; and S,, are given by 
=e" or at+niogr = 2kr+z. 


The equations of L,, symmetrical to L;, are found by reversing the signs of 
the second members of (10). Points common to J, and S,, are given by 
a+nlogr = (2k+1)2+ 7/2. Equations (9) and (10) may be obtained 
by substituting the appropriate values of r in (8) and regarding a@ as variable. 

The surface J; is a spiral surface for (10) may be put in the form (4), and 
its spirals are the curves (r). From (10) it appears that every curve (¢) 
of Lz is a straight line intersecting the Z axis and parallel to the zy plane, so 
that Lz is a spiral right conoid. 

The distances b of r, @ of L, from the origin and a of r, @ of L; from the 
origin are given respectively by 


244\2. 2_41\2. 
) sin? 6, a? = ¢ : ) sin? 6 + 4 tant |, 


and for all r, @ we have a? > 6b’, so that a and b are respectively the semi- 
major and minor axes of Schwarz’s ellipse r, ¢; then the zy plane (I,, Iz) 
is the locus of the extremities of the minor axes, and the two conoids (L3, L,) 
form the locus of the ends of the major axes. Since a? — b? is independent of 
r all ellipses corresponding to a constant ¢ have the same focal distance; 
the eccentricity is independent of ¢ so that all ellipses corresponding to points 
of a spiral (r) have the same eccentricity. The maximum eccentricity is 
sin 8 and is given by r = 1; all ellipses, r = 1, have their major axes on the 
Z axis and touch Sy» at the extremities of their minor axes. When r approaches 
zero or becomes infinite the eccentricity approaches zero, corresponding points 
of S,,, receding indefinitely from the origin. 

5. Surfaces related to S,,. The envelope of a family of minimal surfaces 
associate to (1) consists of two surfaces symmetrical with respect to the origin 
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given by* 

x=U,H+V,H", y= U.2H+V.,H", z2=U;H+V;H", 
where 

+)U, +i(v—u)U2+ (w-1)U; 


For E, and E,, the two parts of the envelope of a family of associate spiral 
minimal surfaces, corresponding to the upper and lower signs respectively in H , 


H=+ 


2 


tan B 

The equations of EF, are also given by writing & — n log r in place of a in (8). 
That EF, is a spiral surface, whose spirals are the curves (r), may be shown 
by reducing its equations to the form (4); the expressions for p cos w, p sinw, 
and ¢ for E, are obtained by replacing a by £ — u in the equations following 
(8). Points common to S,, and F,, S,, and EF, are given respectively by 


H = + tan 


E=a+nlogr+ 2kr, t=a+nlogr+ (2k+1)rz. 


Each of these equations has just one positive root r for every integral value 
of k, so that every surface S,, is tangent to each surface of the envelope 
along an infinite number of spirals. The solution of the first equation for 
a=k=0Oisr =1,s0 that Sy is tangent to E, along this curve, which will 
appear as a curve of particular interest. 

The coérdinates of the points of the two nappes of the evolute surface of S,,, 
corresponding to r, ¢ of the latter are 


9 


2 1 : 


‘ | 
>, 
2r? 


where x, y, 2 are the codrdinates of r, ¢ on S,,, given by (8), X, Y, Z the 
direction cosines of the normal, and R the absolute value of either principal 
radius of curvature. The only interest in this evolute surface consists in the 
fact that it is a spiral surface, whose spirals are the curves (r), as may be 
seen by reducing its equations to the form (4). 

In connection with the spiral surfaces related to S,, the following con- 
siderations are of interest: the right circular cone whose axis is the Z axis 


*My paper, American Journal of Mathematics, l.c. 
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and whose elements make the angle y with this axis is 
=2 tan’ y. 
The spiral which cuts the elements of this cone under the angle } is 


e” Sin y cota sin cos (e + v) y sin y cota sin sin (ce + v) 


v sin y cota 


z=e cos Y. 


Equations (4) of any spiral surface may be written, setting » = — n@ and 
h=-—1/n, 


x =e" pceos(wt+¢), y=e"* psin(w+¢), 


The curve (1) of (4’) lies on the cone 2? + 7? = p? 2?/f?, so that tan y = p/f, 
which gives, for r = 1, tany = cosB cota. We find 


tan = siny tan 8, 


so that \ is determined by n = — cot Band y. Since for fixed n (4’) includes 
the surfaces S,,,, L3, £,, and the evolute surface of S,,,, it follows that every 
right circular cone whose axis is the Z axis cuts all these surfaces in congruent 
spirals. 

6. Special spirals on S,,. The differential equation of the lines of curvature 
of (1) becomes for S,,, 


2tnt dy? — = 0. 
This equation integrated gives 
(uv)? + 1 = 


The only spirals (r) among the lines of curvature are given by c = 0. For 
them we have, belonging respectively to the two families of lines of curvature, 


a+nlogr = 2kr, a+nlogr = (2k+1)7z. 


These are the curves of intersection of S,, with the zy plane, and are log- 
arithmic spirals; they are moreover the curves common to S,, and Jy, S,, 
and I» respectively. For a = k = 0 the first equation gives r = 1, which 
is therefore a line of curvature of Syo. 

The differential equation of the asymptotic lines of S,,, is 


di? + dr? = 0, 
giving the integral 
(uv)? + = cy, 


The only spirals among the asymptotic lines are given by c = 0; for these, 
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belonging respectively to the two families of asymptotic lines, 
a+nlogr = 2kr+35, a+nlogr = (2k+1)r+35. 


These are the curves common to S,, and L3, S,, and Ly respectively. The 
first of these equations gives r= 1 for a—7/2=k=0. This line on 
S,,i2 is the Z axis. Since on every surface the Minding parallels with refer- 
ence to the Z axis and the curves of steepest ascent form a conjugate system,* 
the asymptotic spirals are curves of steepest ascent and are the only such 
spirals. 

The properties of the Minding parallels (r) of S,, are analogous to prop- 
erties of the Minding meridians (@) of minimal surfaces applicable to surfaces 
of revolution.f We note further that S,, has no line of curvature or asymp- 
totic line (¢) while minimal surfaces applicable to surfaces of revolution, 
with the exception of the catenoid and its adjoint surface, the right helicoid, 
have no line of curvature or asymptotic line (r). 

From the linear element of S,,, given in § 3, it appearst that the only spiral 
geodesic of this surface is r = 1. This spiral being plane for a = 0, Sno has 
symmetry with respect to its plane, the zy plane; since, for a = 7/2, this 
spiral is the Z axis S,,;, has symmetry with respect to this axis. On any 
,a2 there will correspond to the spiral geodesic, r = 1, 
a spiral geodesic on a cone whose semi-vertical angle y is given by 


surface similar to S 


tan y = cos Bcota. 


On the associate surface S,,, the only spiral geodesic lies on a cone of semi- 
vertical angle y’ given by tan y’ = cos 8 cota’. Then S,, and S,,, are not 
similar if a’ — a +kr. There is no geodesic (¢) of S,,. 

7. Double spirals of S,9. Writing a = 0 in (8) the equations of Sy» are 


sin » sin (@ — B+ nlogr) + nlogr) |, 


= sin re0s (¢ —B+nlogr) + — 8B nlogr) |, 
z= —2e"¢ tan B sin (n logr). 


If in these equations r is replaced by 1/r, x and y are unchanged and the 

sign of z is reversed. If we consider those values of r satisfying the equation, 

n log r = kw, we have the logarithmic spiral lines of curvature in which Syo 
*My paper, Annals of Mathematics, ser. 2, vol. 19, p. 4. 


{t My paper, Annals of Mathematics, le. 
t Eisenhart, Differential Geometry, pp. 134, 266, 267. 
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intersects the zy plane; excepting the value r = 1, it appears that each of 
these spirals is given by two different reciprocal values of r. The direction 
cosines of the normal to the surface, given in § 1, are in terms of r, ¢ 


2r cos , 2rsing 


Changing r to 1/r, X and Y are unaltered and the sign of Z is reversed. It 
follows that along each spiral line of curvature of Sy», except r = 1, two 
branches of the surface intersect; moreover they intersect at a constant angle 
along each curve, the angle depending on r alone and approaching the limit 7 
as r increases indefinitely from unity. 

8. Further properties of spirals on S,,. In the first paper* published deal- 
ing with minimal surfaces applicable to surfaces of revolution E. Bour showed 
that every Minding meridian (¢) of the surface cuts at the same angle all 
members of either family of lines of curvature and all level curves (z). The 
Minding parallels (r) have the same property on S,,, as may be shown 
directly; we find, for example, that the angle of (r) and (z) isa+nlogr, 
and is therefore constant with r. 

The equation of the tangent plane to S,, at r, ¢ is 


xcos¢+ysing P(r)e**, 


9 


P(r) = — sin? see (a logr — &), 


where & is the angle introduced in §5. The helicoidal developable tangent 
to S,, along (r) therefore intersects the xy plane in the envelope of the lines, 
xcos¢@+ysin¢@ = P(r)e~"*, which is a logarithmic spiral congruent to 
the plane lines of curvature (r). It may also be shown from the equation 
of the tangent plane that the planes tangent to the surface along (¢) envelop 
a cylinder whose elements are parallel to the zy plane, and that the cylinders 
corresponding to different values of ¢ are similar and are brought into similar 
position by rotation about the Z-axis through the angle ¢. Bour stated in 
the paper cited that in the case of a minimal surface applicable to a surface of 
revolution the developables tangent to the surface along a Minding meridian 
(@) are cylinders whose elements are parallel to the zy plane; we have proved 
that this is a property of every non-developable surface.t Ribaucour proved 
in the mémoire cited that in the case of minimal surfaces applicable to surfaces 


* “Théorie de la déformation des surfaces,’ Journal de l’école polytech- 
nique, cahier 39 (1862). 


tAnnals of Mathematics, l.c. 
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of revolution these cylinders are similar. Stiibler* determined all minimal 
surfaces which are envelopes of similar cylinders which may be brought into 
similar position by rotation about the Z-axis, finding that such surfaces are 
given by three different forms of F(u) in (1), each depending on several 
parameters, these surfaces including those applicable to surfaces of revolution 
and spiral minimal surfaces. 

9. Symmetries of S,,. From the equations (8) of S,, it appears that if 
a and r are replaced by — a and 1/r respectively the values of x and y are 
unchanged and the sign of z is reversed; it follows that S,, and S,_, have 
symmetry with respect to the zy plane, symmetrical points being given by 
r,@and 1/r,¢. If in (8) nis changed to — n, and consequently 8 to — B, 
if then r, @ are replaced by 1/r, — @ respectively, x is unchanged, and the 
signs of both y and z are reversed; then S,, and S_,, are symmetrical with 
respect to the Y axis. It follows that S,, and S_,_, are symmetrical with 
respect to the xz plane. If the isothermal parameters logr, ¢ are used as 
coérdinates symmetrical points in all three cases are given by changing the 
signs of one or of both coérdinates. 

10. Two properties characteristic of S,,. We prove a characteristic prop- 
erty of spiral minimal surfaces together with a similar characteristic property 
of minimal surfaces applicable to surfaces of revolution given by the following 
theorem, a converse of theorems given in §8. If every Minding parallel (r) 
of a real minimal surface is an isogonal trajectory of either family of lines of 
curvature or of the level curves of the surface the minimal surface is a spiral 
surface or a helicoid; if every Minding meridian (¢@) of a real minimal surface 
is an isogonal trajectory of either of the families named the minimal surface is 
applicable to a surface of revolution. 

To prove the first part of this theorem we observe that the condition that 
all curves (r) be isogonal trajectories of either of the families named may 
be expressed by the equation 


uw F(u) 
=f(ur), 


where F and ® are the conjugate functions in (1) and f is unknown. If this 
equation is differentiated first with respect to u and then with respect to v, 
and the two values of f’ equated, we have 


2F(u) +uF’(u) 2H + oP’ (0) | 
F(u) ®(v) 
where n is a real constant. These give 


F(u) = @(v) = cv, 


*Mathematische Annalen, vol. 75 (1914). 
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so that the minimal surface is a spiral surface, or, if n = 0, a helicoid. The 
condition that each curve (@) of (1) be an isogonal trajectory of any one of 


the families named is 
uw F(u) u 


which leads in a similar way to 
F(u) = cu, @(v) = 


where m is a real constant. The only real minimal surfaces such that both 
families of curves (r) and (¢) have the isogonal property are the helicoids. 
It is easily shown that it is only on the minimal helicoids that either of these 
families, (r) and (@), forms an isogonal system with any of the three other 
families named, for only in this case is the angle of intersection constant. 

It appeared in § 4 that for spiral minimal surfaces part of the locus L is a 
plane containing the center of the Schwarz ellipses;* we have elsewhere proved 
that the same thing is true for minimal surfaces applicable to surfaces of 
revolution if m is different from zero or plus or minus one. We now prove 
that this property is characteristic of these two classes of real minimal surfaces. 

Since the equations of any minimal surface in any position are given by (1) 
we may without restriction suppose the equations of a minimal surface for 
which part of L is a plane containing the center of the Schwarz ellipses to have 
the form (1) and this plane part of L to be the zy plane, z = 0. From (3) 
it follows that, for all u,v, 

z= 
NEU; 


for some determination of the first radical. This gives one of the identities 


U3 V3 Us V3 
where V>-Uj and v>-Vj are conjugate and ¢ is a constant real in the first 
case and pure imaginary in the second. We consider the equation 


U;/vLUi =e, 


and write 
VLU; U; = Xp, Us = up, Us = cep. 


From the last equations 
V+w+e=1, AN’ + wy’ = 0, 
U; = Ap’ + U; = up’ +n’ p, U; = cp’. 


*American Journal of Mathematics, le. 
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From the fact that uw + v,i(v — u), wv — 1 are proportional to the direction 
cosines of the normal to (1) it follows that for all u, v 


(u+vo)U, (w-1)U; =0, 
which may be replaced by the two equations 
u(U; —iU,) —U;=0, U;, +iU;, + uw; =0. 
The last two equations give ©U;’ = 0, from which, substituting the values 
given for U;, Uz, U3 
Substituting the values of U;, U;, U; in the two linear equations connecting 


these quantities, then replacing in these p’ by + pi Vr” +p”, and cancelling 


p, which cannot be identically zero unless (1) is a plane,* we find 

= t(A+ mi + cu), 
+ 


Multiplying the last two equations, 


(A+ in 41 


From this and \? + p? + c? = 1 we find 


A+ A-yw= - — —. 


Differentiating and multiplying the results, \” + wp” = (c? — 1)/u?, and 

p = Ku*"-*, Us = cp = 
If ec = 0 the surface (1) is the plane, z = 0; if ¢ = + 1 the surface is the plane 
z = constant; for all other values of ¢ the function F (wu) of (1) is given by 


F(u) = 7 Us = 


If ¢ is pure imaginary we have F(u) = Au”, where m is a real constant 
numerically greater than one; if ¢ is real and numerically less than one, F («) 
has the same form and m is real and numerically less than one. Such values 
of ¢ give all minimal surfaces applicable to surfaces of revolution, except that 


* My paper, American Journal of Mathematics, l.e. 
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the values m = 0, +1 are excluded, and distinguish two classes of these 
surfaces which have widely different properties.* Finally, if ¢ is real and 
, where n is a real constant, 


numerically greater than one, F (uw) = Aut" 
and the surface (1) is a spiral surface. 
SHEFFIELD ScIENTIFIC SCHOOL 
UNiversity, 1917 


* My paper, Annals of Mathematics, l.c. 


ON THE GROUP OF ISOMORPHISMS OF A CERTAIN EXTENSION 


OF AN ABELIAN GROUP* 


BY 
LOUIS C. MATHEWSON 


INTRODUCTION 


In 1908 Professor G. A. Miller showed that “if an abelian group H which 
involves operators whose orders exceed 2 is extended by means of an operator 
of order 2 which transforms each operator of H into its inverse, then the 
group of isomorphisms of this extended group is the holomorph of H.’’f 
The present paper discusses an elaboration of the idea embodied in Professor 
Miller’s thoerem, the successive developments taking an abelian group H 
each of more general type so that in toto it is proved that if G is formed by 
extending H which has operators of order > 2 by a certain operator from its 
group of isomorphisms which transforms every one of its operators into the 
same power of itself and which is commutative with no operator of odd order 
in it, then the group of isomorphisms of G is the holomorph of H, and is a 
complete group if H is of odd order. If H contains no operators of even order, 
the “certain operator” is any operator from the group of isomorphisms 
of H effecting the stated automorphism; if H contains no operator of odd 
order, the “ certain operator” must transform every operator of H into its 
inverse; if H contains operators of both even and odd orders (a) the order of 
the automorphism of the operators of odd order effected by the extending 
operator is to be divisible by 2” where H contains an operator of order 2” 
but none of order 2"*', or else (b) the extending operator transforms into its 
inverse every one of H’s operators whose order is a power of 2. Obviously 
it is not necessary that the extending operator be “ from its group of iso- 
morphisms,” but that it have certain properties possessed by this operator; 
thus, besides effecting the automorphism required, its first power commutative 
with all the operators of H must be the identity (from which follows that its 
first power appearing in H is the identity). 

The general method used in establishing each of the successive theorems is 
the same; two important steps in each proof are showing that H is character- 


‘ 


~ * Presented to the Society, September 5, 1917. 
t Miller, The Philosophical Magazine, vol. 231 (1908), p. 224. 
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istic in G and that the group of isomorphisms of G contains an invariant 
subgroup simply isomorphic with H. Throughout the paper use is frequently 
made of properties of rational integers, these interrelationships being but 
simple illustrations of the comradeship existing between group theory and 
number theory. The group of isomorphisms is represented by J and a rational 
prime (odd unless otherwise specified) by p, and the statement “?¢ transforms 
every operator of a group into the same power of itself, not the first power ” 


means every operator except the identity. 


THEORY 


1. TurorEM 1. If a group G is formed by extending a cyclic group H of 
order p™ (p an odd prime) by an operator from its group of isomorphisms which 
transforms every one of its operators into the nth power of itself where n = 1, 
mod p, then the I of G is the holomorph of H and is a complete group. 

Let s be a generator of H and let r be the order of the extending operator { 
(note that r divides ¢(p™), = p™"(p —1)*); then r is the exponent to 
which n appertains, mod p™{ Note 1]; and ¢ is from the central of the I of H.{ 

If s; is any operator of H , t's; t = 8}, and ts; effects the same automorphism 
of H as does t, while (*s;, 1 < a <r effects a different one. Moreover, ts; 
is of order r, for 


nr—l 


= s;"—' (since 8; 1 = gives 8; = ts? 
1 (since n” — 1 = 0, mod p”™ and n — 1 $= 0, mod pf). 


Furthermore, ts; cannot be of order < r, for if t is an operator from the I of a 
group H, no other operator effecting the same automorphism of H can be of 
lower order, because the order of an operator in the J of a group must be 
exactly the order of the automorphism it effects. From these properties of 
ts; it is evident that, as soon as H has been shown characteristic in G, in all 
the automorphisms of G ¢ corresponds only to itself or to itself multiplied 
by some operator of H. 

It will next be shown that the orders of the operators in the tail of @ either 
divide r or divide p™.§ Let ¢*s; be such an operator, 1 = a <r which is 
= ¢(p™). The cases will be considered separately. 

* Mathews, Theory of Numbers (1892), p. 18. 

- t Miller, these Transactions, vol. 1 (1900), p. 397. 

t Miller, Blichfeldt, and Dickson, Finite Groups (1916), § 24. 

© Cf. Gauss, Disquisitiones Arithmetice (1801), § 79. 

§ Cf. Miller, these Transactions, vol. 4 (1903), p. 156. 
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Case I. If n* = 1, mod p, the order of ¢ 8; divides r. 
nra—l nra—] 


nra—| 


= (since = gives 8; i* = whence = 
J J Jj g J j 
= 1, because n’ = 1, mod p”™, so that n™ — 1 is divisible by p™ 
while in this first case n* — 1 is not divisible by p. 


Case II, If n* = 1, mod p, the order of ¢ 8; divides p”. 
nP™_1 
= =s8;"-' =1. For suppose p?(b >0) is the high- 
est power of p dividing n* — 1; then n*=1, mod p?, and (n*)”"=1, 
mod p™*t® [Note 2]. Hence, 


m 
mod p™. 

Note 1. If p is an odd prime and n #1, mod p, and if n appertains to the exponent q, 
mod p™, then it appertains, mod p™*', to q or else to pq. 

Obviously the exponent to which n appertains, mod p™*, is not less than g. Let 
nt =l+kp™. Two cases will be made. 

If k =0, mod p, evidently n appertains to the exponent q, mod p™*!. 

If k = 0, mod p, let n appertain to exponent v, mod p™*'!, or = 1+ 

Subtracting the preceding equation gives n?(n**—1) = (lp—k)p™. Since neither 
nt nor (lp —k) is divisible by p, n*-¢ —1 is divisible by p™ but not by p™*, so that 
v —q =0, mod gq, or v is a multiple of g. On raising n? to the ath power there results 
na = (l+kp™)* =1+ akp™ + Np™* (where N is a positive integer). From the last 
member, evidently the first value a can take so that n*¢ shall be = 1, mod p™*? is the value p. 
Hence the proposition. 

Corotuary. If p is an odd prime and n #1, mod p, and if n appertains to the exponent 
q, mod p™, then it appertains, mod p™—, to q or else toq/p. 

For the even prime it is similarly demonstrable that if n = 1, mod 2, and n appertains 
to the exponent 2", mod 2™, then it appertains, mod 2™*', to 2” or else to 2¥*!. 

Corotiary. If n = 1, mod 2, and if n appertains to the exponent 2“, mod 2”, then it ap- 
pertains, mod to 2“ or else to 

Note 2. If d=1, mod p*, where p is an odd prime (e > 0), then d?™ =1, mod p**™, 
This is readily established by mathematical induction. It is evidently true for m = 0. 
Assume true for mth power of p; i. e., d@?” =1-+kp*t™. Raising to the pth power gives 
de™** =~ 14+(k+M) ( M is an integer divisible by p ) or = 1, mod p*t™*, 
Hence the truth of the proposition.* Furthermore, if p* is the highest power of p dividing 
d —1, then p**™ is the highest power of p dividing — 1; because, if mathematical 
induction is again employed, it will be observed that k would be prime to p while M is a multiple 
of p. 

The proof is similar for the even prime that if d = 1, mod 2°, (e > 0), then 2” =1, 
mod 2°+™, Similarly, if 2 is the highest power of 2 dividing d — 1, then d?” —1(m>0), 
is divisible by at least 2”**; and if 2°, e > 1, is the highest power of 2 dividing d — 1, then 
2e+m™ is the highest power of 2 dividing d?” —1, (m=O). 


* Cf. Dirichlet, Vorlesungen wiber Zahlentheorie (1879), p. 333. 
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From the preceding it is further evident that if r= p — 1, @ can always 
take a value such that n* = 1, mod p. Then when 8; is of order p”, or is s, 
t* s will likewise be of order p”; because (as has just been shown) its order 
divides p™, and its p”™'th power is 


in which (n%"* — 1)/(n* — 1) is divisible by p"™ but not by p™. [Note 2.] 
Next, in order to show that H is characteristic in G it will be proved that 
the cyclic subgroup of order p”™ generated by an operator of order p”, such as 
i* s, from the tail of G is not an invariant subgroup of G. (Note that in 7’ s, 
s is of order p™ and n* =1, mod p.) Transforming by ¢ should, if this 
subgroup is invariant in G@, give an operator commutative with 7s; 
i. e., = ts", which if commutative with requires that 
si* = ts" ts, or sits" = 8, or = But this cannot 
be satisfied since the rth power of ¢t is the first power of 1 commutative with 
s and since by hypothesis n — 1 is not divisible by p. Accordingly, H is the 
only invariant cyclic subgroup of order p™ in G and is therefore characteristic. 
Since any automorphism of G is determined by an automorphism of the 
characteristic subgroup H and some one of the p™ operators of order r (that is, 
t, ts, ts’, ete.), the J of G can be represented as a transitive substitution 
group on p™ letters. Now the J of G contaius an invariant cyclic subgroup E 
simply isomorphic with H. For, suppose v is an operator from the J of G 
leaving the operators of H invariant but transforming ¢ into ts, where s is 
of order p"™. Then vo" tv = ts gives v~* tv* = ts’, etc., from which it is seen 
that v is of the same order as s. Hence E is a cyclic subgroup of order p”. 
The characteristic subgroup H can be automorphic exactly as any cyclic 
group of order p”, because ¢t transforms every operator of H into the same 
power of itself and because the first power of ¢ appearing in H is the identity. 
Hence, the order of the I of G is equal to the order of H multiplied by the order 
of its own group of isomorphisms, which product is the order of the holomorph 
of H.* Furthermore, since the J of G contains an invariant cyclic subgroup 
of order p” and can be written transitively on p” letters, it is simply isomorphic 
with the holomorph of H. This last point can be proved directly by showing 
that operators effecting automorphisms of the characteristic subgroup H, 
transform the operators of the invariant cyclic subgroup E£ in exactly the same 
way. Since H is of odd order, the J of G has the interesting property of being 
complete, since “‘ the holomorph of any abelian group of odd order is a complete 
group.’’f 
2. If the order of the cyclic group H is 2p", H contains a characteristic 


* Miller, Blichfeldt, and Dickson, loc. cit., p. 46. 
t Miller, Mathematische Annalen, vol. 66 (1908), p. 135. 
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operator of order 2, furthermore G (formed by extending H by an operator 
from its group of isomorphisms which transforms every one of its operators 
of order > 2 into the nth power of itself where n = 1, mod p) would be the 
direct product of two characteristic subgroups having only the identity in 
common, one this group of order 2 and the other the non-abelian group formed 
by extending the cyclic subgroup (of H) of order p™. The J of this G is thus 
the holomorph of H .* 

3. Now a cyclic group H of order > 2 can be extended by an operator of 
order 2 which transforms its operators into their inverses, and the group of 
isomorphisms of the extended group is the holomorph of H. If h (which is 
the order of the group H) is a number > 6 and has primitive roots, evidently 
H can be extended by operators from its group of isomorphisms that will 
transform every operator of order > 2 into the same power of itself other than 
into its inverse. To have primitive roots h must be of the form 2, 4, p”, or 
2p™ (p an odd prime).t As the case h = 2 is trivial and h = 4 comes under 
the theorem on inverses, / will be considered to be one of the other forms; 
that is, h = p™ or 2p”. Then since a primitive root n of h (m > 1) isa primi- 
tive root of all powers of p,t{ the following may be based upon what has just 


been proved or can be proved independently: 

Corotuary. If a group G is formed by extending a cyclic group H whose 
order h = p™ or 2p™ by an operator from its group of isomorphisms which trans- 
forms every one of its operators of order > 2 into the nth power of itself where n 


is a primitive root of h, then the I of G is simply isomorphic with G , moreover 
G is complete if h = p™. 

Here n appertains to the exponent ¢(h), mod h; hence r, the order of the 
extending operator ¢, is ¢(A), and the order of G is h®é(h) so that G@ itself 
is seen to be the holomorph of H. 4H is the only invariant cyclic subgroup 
of order h in G and is therefore characteristic. If h = p™, Gis the holomorph 
of an abelian group of odd order which is characteristic in it, and hence G 
is complete.§ 

4. In case the cyclic group H is of order 2" (m > 1) and n = 1, mod 2, 
r, the order of the extending operator ¢ from the group of isomorphisms of H , 
divides ¢(2") = (and if m > 2, r is not greater than 2”-*).|| The 
attempt to establish a general theorem analogous to the preceding by a similar 

*Miller, these Transactions, vol. 1 (1900), p. 396. 

t Mathews, loc. cit., §§ 19-29. 

t Lebesgue, Journal de Mathematiques, vol. 19 (1854), p. 334; ef. also 
Dirichlet, loc. cit., p. 334. 

§ Burnside, Theory of Groups (1897), § 169; cf. Miller, Mathematische Annalen, 
vol. 66 (1908), p. 135. 

|| Weber, Lehrbuch der Algebra, Bd. II (1899), §17; cf. Burnside, loc. cit., also Miller, 
Bulletin of the American Mathematical Society, vol. 7 (1901), 
p. 351. 


336 LOUIS C. MATHEWSON [October 


method fails because ts (where s is a generator of H) is not of the same order 
as t, excepting in the case { st = s, when G is dihedral.* For, suppose 
t st = s" where the order of ¢ is 2“, so that m appertains to 2“, mod 2” 
(whence u < _m). It will be shown that ts is of order 2“ when and only when 
n = 2™— 1, which means that ¢ transforms the operators of H into their 
respective inverses so that its order is 2andu = 1. 
(is)™=i-f .-- st 
ne 
= 1(s")"-' = , which if equal to the identity, requires that 
(n™ — 1)/(m — 1) be congruent to zero, mod 2”. In Note 3 it is shown 
by number theory that this is true when and only when n = 2™—1. It 
may be observed, however, that by use of Note 2 it can be proved easily 
that the order of ¢* s° divides 2”. 

5. It is now an easy step from Theorem 1 to the following (in which p 
still represents an odd prime): 

THeorEM 2. If a group G is formed by extending an abelian group I of order 
p™, type (m1, M2, Me), My = M,mM = mM + M + + 
by an operator from its group of isomorphisms which transforms every one of its 
operators into the nth power of itself where n = 1, mod p, then the I of G is the 
holomorph of H and is a complete group. 


Nore 3. If n appertains, mod 2", to 2" (u >0), then a necessary and sufficient condi- 

tion that 
—1 

mod 2", is thatn = 2™—1. (It is supposed that m > 1, that n is odd and 0 <n < 2”). 

If n = 2™ — 1, then u = 1, and substituting shows that this value is sufficient to satisfy 
the congruence. 

It will now be shown that it is necessary that u > 1 and from this that n = 2™ —1. 
Let the highest power of 2 dividing n — 1 be 2. Nowe must = 1, for if e > 1, from Note 
2 the highest power of 2 dividing (n*“ — 1) /(m — 1) would be 2“, which is less than 2” 
since 2“ divides ¢(2"). Accordingly, e = 1 and n = 1 + 2k where k is odd. The require- 
ment that (n*“ —1)/(n—1) be divisible by 2” then requires n?“ — 1 to be divisible 
by 2™*1. Squaring n = 1 + 2k gives n? =1+k(k+1)2?; hence n* —1 is divisible 
by 2°*? (and by no higher power of 2) where 2° (b > 0) is the highest power of 2 dividing 
k + 1; and hence by Note 2, n** — 1 is divisible by 2°*“*! (and by no higher power of 2), so 
that the requirement that — 1 be divisible by 2”*! would necessitate b + u+lim+1, 
orb+u=m. Again by Note 2, if u > 1, n*“* —1 is divisible by 2°*“, and hence by 2” 
(because b ++ u=m). But this is impossible since n appertains to exponent 2“, mod 2”. 
Therefore u > 1, oruw=1. Then b=m —1, so that 


n=1+2k 


Since n < 2, c = 1 and accordingly n = 2™ —1. 


* Miller has shown that, if ¢ does not transform the operators of this H into their in- 
verses, the order of ts* is the least common multiple of the orders of ¢ and s*. These 
Transactions, vol. 4 (1903), p. 154. 
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The order r of the extending operator ¢ divides ¢(m,) and no operator 
(other than the identity) is transformed into itself. Evidently ts (where s 
is any operator of H) effects the same automorphism of H as does ¢ and is 
of order r. This as well as the following points can be proved by methods 
exactly like those in the proof of Theorem 1: the orders of the operators 
(1*s) outside H divide r if n* = 1, mod p, or divide p”, if n* = 1, mod p; 
if s is chosen of order p™, then the order of ¢* s may be p™ (n* = 1, mod p); 
that the cyclic subgroup of order p™ generated by such a ¢* s is not an invariant 
subgroup of G. Hence, the cyclic subgroups of order p™ in H which are the 
only ones of this order invariant under every operator of G, are characteristic 
in G (in fact they form a characteristic set*). Moreover, since no operator 
outside of H is commutative with each operator of a cyclic subgroup of order 
p™ in H, therefore H is the direct product of the only cyclic subgroups of 
order p™ that are invariant in G and the cyclic subgroups of G whose orders 
are powers of p and whose operators are commutative with the individual 
operators of the said cyclic subgroups of order p™. Thus H is characteristic 
in G, and the remainder of the proof proceeds as in the preceding theorem. 

6. From Theorem 2 a more general proposition in which H is an abelian 
group of odd order is easily deduced. The requirement that n = 1, mod p, 
can be made by stating that no operator (excepting the identity) is transformed 
into itself by the extending operator. It will now be shown that if a group G 
is formed by extending an abelian group H of odd order by an operetor from its 
group of isomorphisms which transforms every one of its operators into the same 
power of itself, not the first power, then the I of G is the holomorph of H and is a 
complete group. 

Let h = pi" py? --- p?*, where pi, «++, py are distinct odd primes and 
pi > Po > +++ > pe. We may take k& > 1 since Theorem 2 covers the case 
fork =1. Let Hy, ---, H;, be the abelian subgroups of orders pi", ---, pz, 
respectively. If pi", ---, p;* are the respective orders of the cyclic sub- 
groups of largest orders in H,, ---, H;, then the order r of the extending 
operator ¢ is the least common multiple of the exponents to which n appertains 
with respect to the moduli pi", ---, p;*, m being the power into which / 
transforms each operator of H. Let s; be an operator of order p™ in H; 
and let ¢ be made up of the commutative cycles t,, ---, ¢, where the order of 
t; is r; and where /, transforms s; into its nth power but is commutative with 
all the other s’s (2 = 1, ---,k); thent =f, + 

Evidently ts (where s is any operator of #7) is of order r, and the cycle ¢; s; 
(where s; is any operator of H;, 7 = 1, ---, k) will always be of order r; 
and r is the least common multiple of the r’s; and ¢s transforms H exactly 


*American Journal of Mathematics, vol. 38 (1916), p. 21. 
t Burnside, loc. cit., § 38. 
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as t does. It will next be shown that H is characteristic in G. First, H, 
is characteristic in G since the cyclic subgroup generated by s; is invariant 
under @ while any cyclic subgroup of order pj in G outside H would, since p; 
is the largest prime, necessitate that the cycle t{ s; be of order py (0 <a <r). 
According to the proof under Theorem 1 this cyclic subgroup generated by 
t{ s,; would not be invariant in G; -hence with argument similar to that in the 
proof of Theorem 2, H;, itself is characteristic in G. Next, H2 is characteristic 
in G, since the cyclic subgroup generated by s2 is invariant under G and 
each of its operators is commutative with each of the operators of H,. Any 
cyclic subgroup of order py? in G and outside of H would not be invariant, 
for by the proof of Theorem 1, if the cycle t: s. generated a cyclic subgroup 
of order p3*, this subgroup would not be invariant in G; also, if some 
power of ¢; s;-t2 8; (where s; is an operator of H, and s; is an operator of H;) 
generated a cyclic subgroup of order py*, evidently its operators would not 
be individually commutative with the operators of H, (since in such a power 
the first cycle would not reduce to the identity). The only operators (of 
G) whose orders divide p;* and which are commutative with the individual 
operators of H,; and those of the cyclic subgroup generated by s2, are operators 
of H,. Hence, is characteristic in G. Similarly, for H3, ---, and 
hence the proposition as stated. Or, according to the observation made in 
the introduction that requiring the particular extending operator to be “ from 
the group of isomorphisms of H ” is, here, only another way of saying that 
the order of the extending operator must be the same as that of the auto- 
morphism which it effects, the proposition may be stated as follows: 

THEOREM 3. If a group G is formed by extending an abelian group H of odd 
order by an operator t which transforms every operator of H into the same power 
of itself, not the first power, where the order of t equals the order of the automorphism 
it effects, then the I of G is the holomorph of H and is a complete group. 

From § 2 in which the case for h = 2p” is discussed, it is apparent that 
from Theorem 3 follows 

Corollary. If a group G is formed by extending an abelian group H whose 
order h = 2h’ where h’ is odd and > 1, by an operator t which transforms every 
operator of H of order > 2 into the same power of itself, not the first power, where 
the order of t equals the order of the automorphism which it effects, then the I of 
G is the holomorph of H. 

7. If H is an abelian group different from those of the preceding theorems, 
the following may be stated: 

THEorEM 4. If a group G is formed by extending an abelian group H whose 
order h = 2" h’ where m > 1 and h’ is odd and > 1, by an operator t which 
transforms every operator of H into the same power of itself such that no operator 
of odd order is transformed into its first power and such that the order of t equals 


q 
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the order of the automorphism of H which it effects and (a) the order of the auto- 
morphism of the operators of odd order which t effects is divisible by 2" where H 
contains an operator of order 2” but none of order 2"*', or else (b) t transforms 
into its inverse every one of H’s operators whose order is a power of 2, then the 
I of G is the holomorph of H. 

If H’ is the abelian subgroup of order h’ im H, it can be shown to be charac- 
teristic in G by a method analogous to that in Theorem 2. The operators whose 
orders are powers of 2 in H are the only operators in G which are commutative 
with the individual operators of H’. Hence H is characteristic in G. The 
product of ¢ multiplied into any operator of H effects the same automorphism 
of H as does t, and is of the same order ast. This latter statement about 
orders will be discussed. If ¢ is commutative with the operators of H whose 
orders are powers of 2, the statement is evident; if ¢ is not commutative with 
these operators, the cycle in ¢t which transforms these operators into their same 
powers is, by § 4, always of an order which divides 2". Hence part (a) of the 
theorem is proved. In part (0), if ¢ transforms each operator of H into its 
inverse, G is the generalized dihedral group; if t transforms every operator of 
H into the same power of itself and this transformation takes operators 
whose orders are powers of 2 into their inverses but operators of odd orders 
not into their inverses (and none excepting the identity into itself), then one 
cycle of ts (where s is any operator of H including the identity) is always 
of order 2 while the order of the other part is a constant > 2. Accordingly, 
as in the other theorems, the I of G may be written as a transitive substitution 
group on & letters and contains an invariant subgroup simply isomorphic 
with H; etc. 

8. One part of the preceding theorem is a special case of the proposition 
that if G is formed by extending an abelian group H which is the direct product 
of two groups H’ and H"” whose orders are relatively prime and where the order 
of H”’ is not divisible by 4, by an operator t which is commutative with the operators 
of H’ but which transforms every operator of H’ of order > 2 into the same power 
of itself, not the first power, and where the order of t equals the order of the auto- 
morphism of H’’ which it effects and is divisible by the order of each operator 
of H’, then the I of G is the holomorph of H. 

This proposition as well as the preceding theorems and Professor Miller’s 
theorem on the J of a generalized dicyclic group* are special cases of the 
following theorem, each being a case in which it can be proved that H is 
characteristic in G, that the J of G contains an invariant subgroup simply 
isomorphic with H, and that the subgroup H can be automorphic just as any 
abelian group simply isomorphic with H can. 


. Miller, Blichfeldt, and Dickson, loc. cit., p. 170; ef. also the theorem following the one 
on the dicyclic group. 
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TueoreM 5. If G is formed by extending an abelian group H by an operator 
t which transforms every operator of H into the same power of itself and which 
is such that ts; (i = 1, «++, h where s,, ---, 8, are the operators of H) are all 
of the same order and have a single characteristic operator of H for their same 
first power in H , then the I of G is the holomorph of H provided H is characteristic 
inG. 

The hypotheses here include the points established in the detail of the proof 
of Theorem 1, so that the last two paragraphs in § 1 give discussion sufficient 
for showing the truth of this theorem. 
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CONCERNING THE ZEROS OF THE SOLUTIONS OF CERTAIN 


DIFFERENTIAL EQUATIONS* 


BY 
WILLIAM BENJAMIN FITE 


In volume 42 of thee Mathematische Annalen Kneser has 
shown that the solutions of equations of the form y™ + qy = 0 oscillate 
an infinite number of times, provided that 2” q > k > 0 for sufficiently great 
values of x, where n = 2m > 0 and nis even. In case n is odd the solutions 
either oscillate an infinite number of times or approach zero together with 
their first n derivatives. In Section I of this paper the oscillation of solutions 
of certain linear equations of the second order is discussed and some results 
are obtained which are not included in Kneser’s. Linear equations of a 
certain trinomial form are discussed in Section II, and in Section III Kneser’s 
conclusions are shown to hold when m is restricted merely to being less than 
n. A condition of a slightly different nature; namely, that the integral 
J%,qdx diverge, is also shown to be sufficient to insure the oscillation of the 
solutions an infinite number of times, except possibly when n is odd. It is 
worth while observing that not only must the solutions vanish an infinite 
number of times, as Kneser says, but they must change sign an infinite number 
of times. Some of the conclusions of this section are also applicable to certain 
non-linear equations. Finally, in Section IV, the solutions of systems of 
linear equations are considered. It is shown as a particular case of a more 
general result that under specified conditions not all the functions which 
form a solution can vanish within an interval whose length is less than an 
assigned value. 


4 


Consider the equation 
(1) y” + py’ + qy = 0, 


where, for z= ™, |p| [= M, and g=h>O, M and h being constants 


such that 4h — M? > 0. 
If (1) had a solution y that vanished only a limited number of times for 
x > a, and if y’ were not negative or zero for all values of x greater than a 
* Presented to the Society October 27 and December 27, 1917. 
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certain 22, we could take a number ¢( > 22) such that y’(c) > 0 and con- 
sider the solution y2 of the equation 


(2) _ — My’ + hy =0, 


whose derivative vanishes for x = c.- We can choose 22 sufficiently great 
to insure that y. = 0 for a value a( < c) of x greater than the greatest root 
of y. If y% is the solution of (1) such that y,(a) =0, then y, (x) =0 
fora <a=c.* There is therefore a solution y3 of (1) such that y;(c) = 0 
and y3(2) = 0 fora=a<ec. We can assume that y(c) = y3(c). For 
values of x a little less than c, y(x) < y3(x). Therefore y(xz) = y3(2) 
for some value of x between a andc. But this would give a solution, y — y;, 
of (1) which vanished twice in (a, c). Since this is impossible, we can 
assume y’ (x) = 0 for all sufficiently great values of x. 
Consider now a solution y; of the equation 


(3) ~y"” + My’ + hy =0 


such that y,(¢) = 0, where is such that y’(c) <0. Now = 0 for 
some b greater than c¢ and the solution of (1) that vanishes for 2 = b has a 
derivative that vanishes in (c, b).¢ Hence if y; is the solution of (1) such 
that y;(¢) = 0, then y;(2) = Oforsomezin(c,b). We could then assume 
that y(c) = y;(c). But this would require that y(x) = y;(a) for some x 
in (c, b) exclusive of c. This is, however, impossible and we have therefore 


completed the proof of 

THeoreM I. [f in equation (1) p and q are continuous functions of x such 
that |p|= M and gq=h> 0 for every x = x, where M and h are constants 
such that 4h — M? > 0, then every solution changes sign an infinite number of 
times.t 

In his memoir on Linear Differential Equations of the Second Order§ Sturm 
considered the effect produced upon the zeros of the solutions of (1) by certain 
changes in the coefficients. These changes however were not of the kind 
considered in the following theorem since they did not involve a simultaneous 
increase in p and q.|| 

TuHeoreM II. in the equations 


(1) y’ + py +qy =9 
and 
(4) y’+ny +ny =0 


*Annals of Mathematics, vol. 18 (1917), p. 216. 

tAnnals of Mathematics, loc. cit., Theorem II, p. 216. 

t Cf. Kneser, Journal fir die reine und angewandte Mathematik, 
vol. 117 (1897), p. 80. 

§Journal de Mathématiques, vol. 1 (1836), p. 106. 

| Cf. Annals of Mathematics, loc. cit., p. 216. 
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P, fi, and q are continuous functions of x suck that,» =p=0,m 
and q > 0, for all values of # greater than x, , and if every solution of (4) vanishes 
an infinite number of times for these values of x, then every solution of (1) 
vanishes an infinite number of times. 

Let y2 be the solution of (4) that vanishes for x = a >2,. Then since 
by hypothesis y2 vanishes an infinite number of times for 2 > a, there is a 
number b( > a) such that y,(b) = 0. If now y is the solution of (1) that 
vanishes for x = a, then y’(c) = Oand y(c) > 0, wherea <c=b.* Now 
y’ (ce) = —q(e)y(e) <0. If y’’(x) remained negative for all values of 
x greater than c, y would vanish for some such z. If y’’(x) did not remain 
negative, it would become zero. Let d be the first value of x > ¢ for which 
(x) ="0. Then p(d)y’(d) +9q(d)y(d) =0. But p(d) =0 by hy- 
pothesis and y’(d) <0 since y’(c) = 0 and y’(x) =0 in the interval 
(ce, d). Hence this last equation is an impossible one, unless y vanishes in 
the interval (c, d). Therefore y(2) has one root, and hence an infinite 
number of roots, greater than a. 

The assumption in the theorem that every solution of (4) vanishes an 
infinite number of times for x > x is more restrictive than it need be. It 
would be sufficient to assume that for these values of 2 every root of every 
solution is followed by a root of the derivative of this solution. 

It is natural to inquire whether the restriction that p = 0 is a necessary 
one. As a partial answer to the question we observe that some restriction on 
p is certainly necessary, since no solution of the equation 


+ 3y' + 2y = 0 
vanishes more than once, although every solution of the equation 
y"+y +y=0 
has an infinite number of roots greater than any assigned number. 
If we change the dependent variable from y to 7 by means of the relation 
y = 27, where d is a constant, we get 


2n 
+(v+>)a+( +P 40)y=0. 


If now 


(5) 


Ap 
= 

+ +9750 
for x = 2, > 0, no g, and therefore no y, can vanish more than once for these 
values of . Various sufficient conditions for the non-vanishing of y for more 
than a finite number of values of x greater than 2, can be obtained from (5) 
by assigning particular values to). 


*Annals of Mathematics, loc. cit. 


if 
if 
| 
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II 
Consider the equation 
(6) + + = 0, 


where p and q are continuous functions of x in the interval considered and q¢ 
does not vanish. In counting the number of times a function vanishes each 
zero will be counted a number of times equal to its multiplicity. 

If y"» (x) = 0 and y(a) + 0, the sign of y™ (2x) is opposite that of 
q(x)y(2). Hence y cannot vanish twice within any interval within 
which y does not vanish, and therefore y’ cannot vanish n times within any 
such interval. 

If gq > 0 for x = x, a zero of y in this interval followed by a sequence of 
n — 1 zeros of y’ without a zero of y must be followed by a zero of y in case 
n is greater than 2. For y‘*~» has just one zero in this interval, say x = a, 
and must therefore be negative for values of x greater than a, since y™ (a) 
is negative. Moreover y‘*~? must have a zero which is equal to, or greater 
than, a, and for all values of x greater than this zero y“— and y—» would 
both be negative if y did not vanish. But this is impossible. The exception 
for n = 2 is an exception in fact, as may be seen from a consideration of the 
equation y” + 3y’+2y =0. The solution y = e* —e~™ has the single 
zero x = 0, and y’ = — e* + 2e-* vanishes for x = log 2. 

If g < 0 in an interval, a sequence of n — 1 zeros of y’ without a zero of y 
cannot be followed by a zero of y. For if this were possible y“~” would 
change from zero to a negative value while y™ is positive. But there must 
be an odd number of zeros of y’ between successive zeros of y. Hence if n is 
even and q < 0, there cannot be more than n — 3 zeros of y’ between successive 
zeros of y.* 

If n is odd and q does not vanish, there cannot be more than n — 2 zeros of 
y’ between successive zeros of y. If q > 0 for x = a and a zero of y in this 
interval is followed by a sequence of n — 2 zeros of y’ without a vanishing of 
y, y”” must change from a positive to a negative value, and therefore 
y"» must be negative when y‘"~ is negative. But y"-» cannot change 
from a negative value to zero while y is positive since when y“*~» is zero y™ 
is negative. If then y remained positive, y%—? and yy would remain 
negative. But this is impossible. Hence if nis odd and q > 0 forxz = ™,a 
zero of y in this interval followed by a sequence of n — 2 zeros of y’ without a 
zero of y must be followed by a zero of y. 

Suppose now that, for x = 2,, p and q are continous functions of x such that 


p =0,q=h>O0, and pis bounded. If for any value of x under considera- 


*For the case n = 2 see Bécher, these Transactions, vol. 3 (1902), p. 199. Cf. 
also Kneser, Journal fiir reine und angewandte Mathematik, vol. 
116 (1896), p. 181; Mathematische Annalen, vol, 42 (1893), p. 413. 
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tion y and y‘*~ are both positive, y“ must be negative. Then y‘*~” must 
decrease as 2 increases. If y does not vanish as 2 increases indefinitely and 
y"—» remains positive, it is possible to select values of x as great as we please 
for which the absolute value of y™ is as small as we please. Therefore y will 
be as small as we please for indefinitely great values of x. But y’ can vanish 
only a limited number of times for x = 2,. Hence y must approach zero. If 
on the other hand, y‘"~ is zero for some value of x for which y is positive, it 
will be negative for slightly greater values of x. But it cannot increase from 
a negative value to zero if y remains positive since when it is zero y™ is 
negative. And if it remains negative, y‘"~ must steadily decrease. If as x 
increases indefinitely y does not vanish and y‘*— does not become negative, 
it is possible to select values of x as great as we please for which the absolute 
value of y‘*~» is as small as we please. Then since p is bounded it is possible 
to select values of xz as great as we please for which py‘"~” is as small as we 
please in absolute value and y is positive. This requires that y come as near 
to zero as we please, and therefore that y approach zero as a limit. If y‘"~” 
should become negative it would remain negative and y would ultimately 
vanish. This completes the proof of 

TuHeoreEM III. If in equation (6), for x = x, p and q are continuous func- 
tions of x such that p=0,q = h> 0, and p is bounded, every solution either 
vanishes an infinite number of times or approaches zero as x increases indefinitely. 


III 


We consider now equations of the form 


(7) y + qy =0, 


where q is a continuous function of x for x2 2,. For any number a = 1 
and any solution y of (7) we have 


(8) y(x) 


n—1 


where r=, % =~@+060(x—a), and 0<@<1. If now the equation 


‘i —a)"* 
(9) y(a) + y'(a) (x —a) + +y™ 
is satisfied by a positive value of x — a, say b — a, then y(2) = 0 for some 
« between a and b, provided that g > 0. For if this were not the case we 
could assume y > 0 fora < x Sb, and if we put x = b in (8) the left member 
would be positive while the right member would be negative by virtue of (7) 


4 
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and (9). If, on the other hand, gq < 0 and y(a) = 0, the first root b of 
y(x) = 0 that is greater than a must be such that b — a exceeds the least 
positive value of x — a that satisfies (9). In particular, if no positive value 
of x — a satisfies (9), y cannot vanish for any value of x greater than a. 

We get immediately from (7) 


(z) = -f qydx + (x). 


If now y is greater than, or equal to, zero for x = 2, and g = 0, the integral 
J< qydx cannot diverge as zx increases indefinitely. For if it did, y~ would 
become negative, and, since y™ is by hypothesis negative or zero, y would 
become negative, contrary to the hypothesis. 

TueorEeM IV. Jf ¢q 20 when x = x and y is a solution of (7) such that 
the integral [* qydx diverges as x increases indefinitely, y must change sign an 
infinite number of times. 

In case g = 0 any solution that does not change sign an infinite number 
of times must become infinite if the integral diverges, since y‘*“-» becomes 
and remains greater than any given positive number. 

A direct application of Theorem IV shows that if the integral //, qdzx 
diverges there can be no solution y which exceeds a given positive number h 
for x = 2x,. Moreover if y does not change sign y’ cannot vanish more than 
n — 1 times. Hence any solution which does not change sign an infinite 
number of times must approach zero as x increases indefinitely. 

Now we have already made use of the fact that if two consecutive derivatives 
of y are negative or zero for all values of x greater than a given number a, y 
must ultimately be negative; and if two consecutive derivatives are positive 
or zero for these values of x, y must become positively infinite with x. More- 
over if we assume that y is not negative for x greater than a, y‘” must be 
negative or zero for these values of x. Hence if y“*~» were negative or zero 
for any such value of z it would be negative for all greater values and y 
would change sign. We must assume then that y“~” is positive when z is 
greater than a. If y~® were positive or zero for such a value of 2, it 
would be positive for all greater values and y would become infinite with z. 
We must assume then that y‘"~ is negative when 2 is greater thana. By 
continuing this argument we conclude that when n is even y’”’ must be negative 
for these values of x. On the other hand, the supposition that y approaches 
zero requires that y’ become negative and hence the supposition that y does 
not change sign for x greater than a is untenable. 

When n is odd this argument leads to the conclusion that if y does not change 
sign an infinite number of times, y’ must become and remain negative. But 
this is not inconsistent with the fact that y approaches zero as x increases 
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indefinitely,* as may be seen from the function y = e~, which is a solution of 
(7) when n is any positive odd integer and g = 1. If however y vanishes at 
all for z = 2, it must change sign an infinite number of times, since otherwise 
it would approach zero and y’ would change sign from positive to negative. 
This would require y’’ to be negative for some value of x greater than 2. 
It would then be negative for all greater values of x and y would change sign. 
This completes the proof of 

TueoreM V. If q > 0 and the integral qdx diverges as x increases in- 
definitely every solution of (7) must change’ sign an infinite number of times for 
x = 2% in case n is even, and in case n is odd such a solution must either change 
sign an infinite number of times or not vanish at all for x = 2x, .f 

Suppose now that we have 


z"q>h>0O 


for x =2,. This inequality will be strengthened if we replace m by m, 
where m > m,. We can so select m that 1/(nm — m) is an integer greater 
than 1, whatever value m, may. have less than n. If y did not change sign 
an infinite number of times or did not approach zero as z increases indefinitely, 
there would be a positive number /; such that y(2) > h, for sufficiently great 
values of z. We proceed to prove that this is impossible and that therefore 
y must either change sign an infinite number of times or approach zero as 
increases indefinitely. Moreover it follows from the proof of Theorem V 
that the latter can occur only under certain circumstances when n is odd. 
We take 2, sufficiently great so that y(a2) > h, for.2 = 2, and consider the 
consequences of the inequality 
(10; y(2) 


where k = hh, > 0. From (10) we get 


k 1 1 


y—» (a) would be negative for sufficiently great values of x and y would 
become negative. We assume therefore that if n > 1 


k 
(a) = (m — 2 0. 
l 


-* Tn this case y’, y”’, «++, y all approach zero. Cf. Kneser, loc. cit., vol. 42, p. 435. 
7 Cf. the result given by Kneser, loc. cit., p. 420. 
Trans. Am. Math. Soc, 23 
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Moreover if the expression in the left member of this inequality should become 
negative as 2, increases, we could apply the preceding argument for a properly 
chosen 2,. There remains then to be considered the case in which 


k 


forz =2,. From this inequality we get 


k 1 1 


1 
If n > 2 and y‘*~ (2) is to remain negative as x increases we must have 


(n—2) — 


Hence 


I: 1 1 
(n-3) «< (n—3) ( » 


Now if y does not change sign as x increases, no derivative of index 
n — 21 — 1 can become negative if all the derivatives with indices of the form 
n — 2}, where j =i, remain negative. There are therefore but two conceiv- 
able results of a continuation of this argument: 

(a) The derivatives of y from the nth to the first inclusive are of alternate signs 
forx =a,. In case n is odd this supposition leads to the inequality 


y (x) =(m —1)(m—2)-++(m =n) 


From this we conclude that y must become negative. 
When x is even the situation is not so simple. In this case we have 


k 
(11) (m —1)(m — 2) +--+ 
Hence 
—k 


y (x) —1)-+-(m—n+1)(m- — a") + y (1). 


There is therefore a positive number /, such that y(2) >) 2"™ for suffi- 
ciently great values of x, and we can replace (10) by 


y™ (2) (ky > 0). 


2m—n 


This in turn gives us y(z) > 1, (1, > 0), and we can accordingly 
replace (10) by 


ke 
y™ (21) < -=— (ke > 0). 


) = 0 
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A continuation of this process leads finally to the relation 


k, 


(rz) < (k, >0), 


n—1 


which in turn gives us 


k, x . 
y log —+ (m1). 


But this is contrary to (11). 

(b) The derivative of index n — 2i is positive for sufficiently great values of x, 
while all the derivatives with indices of the form n — 2}(0 Sj < 7) are negative 
for x = 2 and all the derivatives with indices of the form n — 27 +1(1 Sj S12) 
are positive for these values of x. 

Since y‘"~9 (a) is positive and increasing, we should have > 1, 
(1, > 0) for sufficiently great values of x. We could therefore replace (10) 
by 

(n) — fy 


But a series of steps similar to those described under (a) shows that this in- 
equality requires that y“*~?**» (x) be negative for sufficiently great values of x. 
This however contradicts our assumption. 

This completes the proof of 

THeorEM VI. in the equation 


y + qy = 0 


q is a positive infinitesimal of order less than n with respect to 1/x(x =m), y 
must change sign an infinite number of times as x increases from 2,, in case n 
is even. If n is odd y must change sign an infinite number of times as x 
increases from x, or not vanish at all for x = x2. 

The restriction that m < n is essential since such an equation as 
15y 
for example, has y = 2°/? for a solution. 

Some of the foregoing results can be extended to certain non-linear equa- 
tions of the form 


(12) y™ + f(x,y) =0. 


Suppose that f (2, y) has the following properties: 

1. It is continuous for z and y finite and z = 2. 

2. When x = 2, it is positive or negative according as y is positive or 
negative. 
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3. For any x (= 2) and any two finite values of y, as y, and ye, 
If(a,m) —f(2,y2)|< Ala — wl, 


where A is a constant. 

No solution has a singular point for any finite value of = 2,. If a solu- 
tion y does not change sign an infinite number of times as z increases from 2, 
it must ultimately either increase monotonically or decrease monotonically, 
since y’ cannot vanish more than n — 1 distinct times within an interval 
within which y does not vanish. But y can approach zero monotonically 
only in case it does not vanish at all for x = 2, and nis odd. A repetition 
of the analysis used in the proof of Theorem IV shows that, if y is such that 
the integral f;, f(x, y)dx diverges as x increases without limit, it must 
change sign an infinite number of times. 

IV 

In thee Annals of Mathematics for June, 1917, I have estab- 
lished the existence of a minimum length for an interval within which any 
solution of a linear homogeneous differential equation of order n and its first 
n — 1 derivatives can all vanish. This raises the question as to the existence 
of such minimum intervals for the solutions of more general classes of equa- 
tions. I show here that these minimum intervals do exist for systems of 
equations of the first order subject to certain restrictions. In fact a some- 
what more general result is established from which the existence of the intervals 
in question can be inferred as a special case.* 

Consider the system of equations 


dy 


= 


where the f’s are continuous functions of their arguments when z is in the 
interval (a, b) and y,, y2, «++, yn are finite. Moreover the inequalities 


* This more general result was not contained in the paper presented to the Society. It 
was suggested to me later by Dr. J. F. Ritt. 


dys 
(A ) dx 
di 
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where I,, Lz, ---, LZ, are certain positive constants, are supposed to be 
satisfied for any finite values of y, and 2x. 

Let 71, Y2, ***, Yn and 2%, 22, 2, be two solutions of these equations 
such that each of the differences y; — z, vanishes somewhere in (a, 6). 
Picard’s method of successive approximations, which is applicable to this 
case, shows that the functions y; and 2; remain finite within (a, 6). The 
difference y; — z; has therefore a maximum absolute value which we shall 
represent by M;. Now 


d 2) 
= lf. (x, Yn) —fi(z,%, 2, 2n)| 


n 
< Lil ye — z|. 
k=1 
-Moreover y; — z; = 0 somewhere within (a,b). Hence 


M;< Mi: p, 


where b — a = p. 

If in this inequality we give to 7 the values 1, 2, --- , min turn and multiply 
the resulting inequalities by [,, 2, ---, Ln respectively, and add, we get, 
after dividing by Mz, 

1< P; 


or 


1 


We have therefore 

TuHeorEM VII. If the corresponding functions of two solutions of equations 
A are equal somewhere within the interval (a, b), the length of this interval must 
exceed Ly. 


If we assume further that the functions f; are such that 
fi(z,0,0,---,0) =0 


within (a, 6), then 0,0, ---,0isa solution of A and we can apply Theorem 
VII to this and the solution #1, y2, ++, yx. This leads to the conclusion 
that if p= no non-identically vanishing solution y:, Ye, Yn Of 
equations A in which f,(x, 0, 0, ---, 0) = 0 within an interval (a, b) of 
length p can be such that all the y’s vanish within (a,b). 

This conclusion can not be drawn from the conditions of Theorem VII, as 
may be seen from a consideration of the system 


dye 


i 
3 
n 
= 
q 
4 
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This system satisfies all the conditions of the theorem for any finite interval, 
but not the condition that f;(2, 0,0, ---,0) =0. On the other hand, it 
has the solution 


= (Gel® + — (12e-1/3 + — Gz, 


Yo = — + 4+ + 4 + Oe + 9 


and y and y2 both vanish when x = — 1. 
Analogous results hold in case the f’s are analytic functions of the complex 
arguments x, Yi, Y2, ***, Yn When 2 remains within the circle of center 2 
and radius r (or upon its circumference) and #1, y2, -**, Yn are finite, subject 
to the condition that for any fixed k from 1 to n inclusive and every 7 from 
1 to n inclusive af 
| Of; | 
OY: | 


k 


for these values of the argument. 
CotuMBIA UNIVERSITY 


DIFFERENTIATION WITH RESPECT TO A FUNCTION OF LIMITED 
VARIATION* 
P. J. DANIELL 


Lebesgue, Radon, and Youngf have defined integrals with respect to a 
function of limited variation, and these are generalizations of the Stieltjes 
integral. The next step which suggests itself is a definition of the corre- 
sponding derivative. Such a definition is given in this paper, and the funda- 
mental property of a derivative is proved by means of a modification of 
Vitali’s theorem. The steps taken are parallels of steps given by de la Vallée 
Poussint{ in the theory of Lebesgue plane sets. Since this paper was first 
written, a paper by Young has appeared, giving a slightly different definition 
of the derivative, and an entirely different trea@ment. The applications at 
the end of this paper are not given by Young. 

Definition of derived numbers and derivative. Consider two functions of 
x, F(x), defined in the fundamental interval,0 =zx=1. The ratio 


AF 


a(r+e)—a(a—e) Aa 


may have upper and lower limits as e approaches 0. We define these as the 
upper and lower derived numbers of F (2) with resp:ct to a(a), and we use 
the notation 


—— AF AF 
= j = 
D, F(x) lim (2) im 
For x equal to 0 or 1 it is necessary to add a convention whereby F(x), 
a(2) are continued beyond the range (0, 1), and have values equal to their 
values at 0 or 1 respectively. If the two derived numbers are finite and equal, 


* Presented to the Society, September 6, 1918. 
+ Lebesgue, Comptes Rendus, vol. 150 (1910), p. 86; Radon, Wiener Sitz 
ungsberichte, vol. 122, section 2a (1913), p. 1295; W. H. Young, Proceedings 
of the London Mathematical Society, vol. 13 (1914), p. 109. 

t De la Vallée Poussin, these Transactions, vol. 16 (1915), p. 435. 

§ W.H. Young, Proceedings of the London Mathematical Society, 
vol. 15 (1916), p. 35. 
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that is, if the relative change ratio AF/Aa possesses a finite limit, this limit 
is called the a-derivative of F (a2), and is denoted by 

(2). 


An interval (2 — €, x + €) we shall call a central interval with center at 2. 
If a(x), F(x), are functions of limited variation, we can define absolutely 


additive functions of sets measurable Borel 


fda, ar, 


and also the corresponding modular integrals, denoted according to Radon by 


lar, 
E 


or according to the writer by 
dw, dQ, 
E 


where w (a2), 2(2), are the variation functions corresponding to a(a2), F(z). 
These modular integrals are additive, finite, and non-negative. 

Absolute continuity. A function F (2) is said to be absolutely continuous 
relative to a(2), a function of limited variation, if given any positive € we 


can find 6 so that 
f <4, 


for all sets e measurable Borel such that 


f do (z) <6, 


where w(x), 2(2) are the variation functions corresponding to a(x), F(x). 
We desire to prove the fundamental proposition: 

Tueorem. If F(x) is absolutely continuous relative to a(x), it possesses 
a finite a-derivative nearly everywhere (w) [that is, except for a point set e for 
which {,dw(x) = 0]; this a-derivative is summable (a) where it exists, and if 


E is any set measurable Borel, 


f ar z) >. F(x)da(zx). 


In this, D, F (2) denotes the a-derivative of F(x) where it exists, and any 
finite value where it does not. 


=z 
4 
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Before we can prove this proposition, it is necessary to prove two lemmas. 
Lemma 1. Given any positive € and a non-decreasing function w(x), any 
set E measurable Borel can be enclosed strictly in a finite or denumerable system 
A of disjoint intervals (that is, intervals with no points common to any two) in 


f de < +e. 
vA E 


Let D be the set of points at which w (2) has finite discontinuities; then 
D consists at most of a denumerable set of points. Resolve w (a) into w,(2), 


such a way that 


a continuous non-decreasing function, and w. (2), a non-decreasing function 
which is stationary except when z passes a point of D. The integral [dw (2) 
will be a continuous additive non-negative function of sets. Hence,* given 
any positive €, we can enclose E£ strictly in a finite or denumerable system A; 
of disjoint intervals so that 


f dw, (x) < f + de. 


1 


The set CE-D consists at most of a denumerable set of points D’. Then, since 


f doce) == des 


is a convergent series of positive terms (considered as the sum of the dis- 
continuities of w(2) at the denumerable set of points D’), we can choose a 
finite n and the finite set of points D, so that 


dw (x) <f dw. (x) + de. 


Dy 


From A, cut out the points belonging to D,, which are finite in number. 
Then A = A,-CD, still forms a denumerable system of intervals enclosing 
E strictly; for the points D, belong to CE. The set A is the same as A; 
except for the exclusion of a finite set of points, or 


f dex (2) = doy (2) < f des (2) + he. 


Again, from the way in which D, was chosen, 


| dwe(x) < 
CE+D-+ CD, 


= () 


* De la Vallée Poussin, loc. cit., p. 470. 


4 
4 
4 
4 
q 
i 
54 
{| 
ii 
A 
f 
i 
But 
4 


356 P. J. DANIELL [October 


over any set e not containing points of D. Hence 


dw. (2) < te, dw. (x) < 
CE+ A, + CE+ 


and, since A = A;:CD,, 


f (2) < de, f < f de (2) + le. 
f do(e) = dos + f den (2) < f doce) 


The lemma is proved. 

The following lemma is a generalization of Vitali’s theorem.* 

Lemma 2. Given a set E measurable Borel, and an infinite family T of cen- 
tral intervals, such that each point of E is the center of an infinity of central 
intervals as small as we please; then a set B can be found consisting of a finite 
or denumerable number of disjoint intervals chosen from T, such that B covers 
nearly all E (that is, except for a point set of w-measure 0) and such that the 
w-measure of B differs from that of E by as little as we please. 

That is to say, given any positive e, we can find B so that 


= 0, f < f + 


By means of Lemma 1, given any positive e, we can enclose E strictly in a 
denumerable set of disjoint intervals A , so that 


ff < f de +e, 
BE 


where w’ (x) = w(x) +2. In what follows we denote the w’-measure of a 
set E simply by mE. Then mE is the sum of the w-measure and the usual 


Hence 


Lebesgue measure. The intervals A are not necessarily central intervals. 
From the family T eliminate the intervals which have points in common with 
Pp 

CA. The remaining family I, will possess the same property relative to FE, 
for each point of E is interior (strictly) to one or other of the intervals A. 

We affirm that with a finite number of disjoint intervals of T; we can cover 
a part e, of E such that me; > kmE, where k is any number less than one-third. 
Fort let the set CE be enclosed strictly in a denumerable set D of open intervals 
so that mD is arbitrarily close to mCE. Then CD is a closed set contained 
in E,, and given any positive e we can make mCD > mE — ¢«. By the Heine- 

* See de la Vallée Poussin, Cours d’ Analyse, second edition, vol. 2, p. 110. 

t This part of the proof is due to the Editors of these Transactions. 
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Borel Theorem* CD can be covered by a finite number of intervals E, chosen 
from the family T,;. Then 


mE, = mCD > mE — e.* 


From these intervals choose first that one having the greatest w’-measure, 
and e!iminate those which have any point in common with it. Next choose 
the remaining interval which has the greatest w’-measure, and so on. After 
a finite number of such steps we shall have chosen a finite number of intervals, 
and the process will terminate. Each time intervals are eliminated, the 
w’-measure of the interval retained will be at least one-third of the w’-measure 
of the interval covered by it and all intervals eliminated as overlapping it. 
Hence the measure of all intervals retained will be at least }mE,,, or will be 
greater than 3mE — e. The part of FE not covered, that is E — e,, is there- 
fore of measure less than mA — 4mE + e, or less than 3mE + 2e. Hence 


me, > — 2e. 
Our affirmation is proved, that with a finite number of disjoint intervals 
belonging to I’; we can cover a part e; of E so that 
me, > kmE, 
where k is any number less than one-third. 

After we have thus chosen e,;, omit from T those intervals which overlap 
e,, and let I, be the remainder. Then I, will have the same properties rela- 
tive to E — e, as T, has relative to E. We can by the same process cover a 
portion of — e;, such that 


me, >km(E —e), 


by a finite number of intervals of T,. Continuing the process, we obtain a 
system B of disjoint intervals of the family T., which are at most denumerable. 
Moreover the set B-E = De,. But 


me, > k(mE — me; — — men), 


and the term in the parenthesis is non-negative, and therefore > me, is con- 
vergent. Thus me, approaches zero, or 


m(>X én) = X me, = mE. 


Then 


mE-B=mE, mE-CB = 0, dw’ = 0. 


But w’ (2) = w(x) + 2, so that 


* Cf. de la Vallée Poussin, Intégrales de Lebesgue, pp. 13-15. 
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{ dw = { dw’ = 0; 

E+CB E+CB 

dw = dw’ < f dw < 
A-E A-E A E 


But B is contained in A, or 
f dw < [ do +e. 
B 
The lemma is proved. 


To return to our original proposition, let us prove it first in the case where 


and, since 


a(x) is a non-decreasing function, so that a(x) = w(2). 
_In any set e, if D, F(x) 21 (this inequality being considered to hold if 
D, F(x) = + «), we shall show that 


farz=if de. 


When this is proved, it will follow as a corollary, and can also be established 


directly by parallel reasoning, that a similar conclusion holds if the signs of 
inequality are reversed, or if D is replaced by D, or both. At any rate, it is 


sufficient to give the proof for the case first mentioned. Corresponding to 
every point in e (which is measurable Borel) we can find an infinity of central 
intervals as small as we please, such that for each 


given any positive e’. These form a family I having the Vitali property 
relative to e. Since F(2) is absolutely continuous with respect to w(x), 
given any positive ¢ we can find 6 so that 


f ao <€, 
de <6, 


Using Lemma 2, we can define a set B consisting of a denumerable system of 


for all sets e for which 


disjoint intervals belonging to I’, such that 


f dw = 0, 
e+CB 
f dQ 

e+CB 


whence 
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and such that 


whence 


Then 


ff do f de <é, far <e. 
e B e ~/B 


But for each of the intervals B, 


AF > lAw — far >tf de, 
Aw B JB B 


far > (1 f do — 6. 


In the limit, as €, e’ approach 0, 6 also approaches 0, and 


farzif de. 


Since {. dF is limited, the w-measure of e decreases to the limit 0 as / in- 
creases indefinitely. Also, since F is absolutely continuous with respect to w, 
J. dF will also approach the limit 0. Thus the set of points for which 
D,, F(x) = + © is of w-measure 0. A similar proof shows that the set for 
which D, F (2) = — © is of w-measure 0. 

Take two finite numbers m, M, and divide the interval between them into 
sub-intervals by <1, < +++ <l, = M, where max |l; 
given positive 7. Let e; be the set in E for which /;_, = D. F <1;; then 


dw nf do. 


By summing up for the sets ¢;, if E’ is the set of points where m = D, F < M, 
g up p 


far f dol sn de 
in the limit, as 7 approaches 0, D, F is summable (w) in E’, and ql 
dF = D, Fdw. i 

E’ 


dw < 4, 
3 Be Ce 
3 dQ <e. 
B+Ce 
| 
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It has been proved already that fy dF approaches 0 as M increases and m 
decreases indefinitely, whence D, F is summable (w) in E, and 


dF D, Fdw. 
E E 


Similarly it can be proved that D, F is summable (w) in E, and 


dF Fdw. 
far = D, Faw 


But D, F = D, F, or at every point of E except on a point set of w-measure 
0, 
D,F=D F=D,F, 


and F (2x) has a finite derivative with respect to w (2) nearly everywhere (w) 


in E. Also 
f ar =|». Fdw. 
E E 


More generally, if a(a) is a function of limited variation, it is absolutely 
continuous with respect to its variation function w(2). We may split any 
set E measurable Borel into two subsets £,, E2, so that if e is a variable set, 


f da = f dw, 


a non-negative function of sets, and 


f da = -f dw. 
e+ e+ Ee 


We have already proved that 


far = = { D. Fda, 


and by the same proof D,, @ exists and is finite nearly everywhere (w), and 


f dw = { da -{ D, adw; 
Ey 


whence D,, a = lim Aa/Aw exists and equals 1 nearly everywhere (w) in £;. 
It follows that D, F exists and equals D, F nearly everywhere (w) in E, ; or 


ar = { Fda. 


Similarly in E,, D, F exists and equals — D, F nearly everywhere (w), and 
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= [ D. Faw = fp. Fda = { D, Fda. 


By combining F, and Ez, it is proved that D, F exists and is finite in E except 
for a set of w-measure 0, that it is summable in EL, and that 


f ar(z) = D.F(2)da(z). 


: Applications. Reduction of general integral to integral of positive type. Let 
g(x) denote the function 


g(x) dD, a(x), 
where this derivative = + 1 or — 1, and 
g(x) =0 


otherwise, that is to say on a set of w-measure 0. On the set Ei, 9(2) = 1 
nearly everywhere (w), whence, if f(x) is any function summable (w), 


ff = ff = f 


Similarly, 


J = f = f 


whence 


This proves that any integral with respect to a function of bounded variation 
a(x) can be expressed as a single integral of positive type, that is to say, 
an integral with respect to a non-decreasing function w(x). 

Integration by Parts. Let F(x) = (a2), where a(x) is a function of 
limited variation. The latter function may be expressed as the difference of 
two that are non-decreasing, 


a(x) = — 


whence 
F(x) = a? (x) = Bi (xr) + B3(x) — 28, (x) 
Q(x) = Bi (x) + + 281 (x) Bo (x) = w (2x). 


The function w(x) is limited and less than some finite number K, so that 
for any interval 


AQ < 2K Ao ; 


ai 
a 
; 
if 
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whence F (2) = a?(x) is absolutely continuous with respect to a(2). 


this case 


AF a?(x+e) —a*(r—e) 


Aa ~ a(a+e) —a(a—e) = a(z +e) +a(2—¢). 


Let a(x) denote 


lim 3 [a(x + €) + a(z—e)]; 


then 
D, a? (x) = 2a(2). 


By the fundamental proposition, 


face (a) = 2a (2)da(e). 


Let a: (x), a(x) be two functions of limited variation; then 


fata + a2 )?] f dad + f dai +2 f dar as) 


In 


f a2) = f + 
E E JE 


Transposed and written more fully, the last relation becomes 


ff = [atau (e)ax(2)) f (2). 
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1. INTRODUCTION AND NOTATIONS 


It is a well-known fact that linear integral equations of the first and second 
kinds may be regarded as the limiting cases, as n becomes infinite, of systems 
of n linear algebraic equations in n variables. 

The same idea of passing to a limit suggests that one treat the integro- 
differential equation 


B \ 


as the limit of a system of n linear differential equations of the first order 
of the formt 


* Presented to the Society, December 28, 1917. The problem treated in this paper was 
first suggested to me by Professor W. A. Hurwitz, to whom, and to Professor M. Bécher, 
I tender my grateful acknowledgment for constant help, suggestions, and criticisms. 

+ For the system (a) when all the equations are homogeneous, a different integro-differentia] 
equation was obtained by Schlesinger (Jahresbericht der Deutschen Mathe- 
matiker-Vereinigung, vol. 24 (1915), p. 84) by means of a process involving 
certain changes of the form of the equations (a) before passing to the limit. The equation 
thereby obtained differs from (A ) in that the variable z is complex and all functions involved 
are analytic functions in z, that the functions u and \ contain another variable r of the same 
class as s, and that 


¢(2,8s)=0 and 
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+ (a) + + hin(t) =A (2), 
(a) 
dun (x) 


dx 


We shall have occasion to adjoin to (A) a boundary condition of the type 


a(#)u(a,#) + B(s)u(b,s) + f [A(s,r)u(a,r) 
+ B(s,r)u(b,r)]dr = y(s). 


This we shall call a two-point boundary condition since it involves the two 
values a and b of the variable x. This is obviously the limiting form of the 
system of linear boundary conditions usually attached to the ‘finite differ- 
ential system (a), as we let the number of equations increase indefinitely. 

Throughout this paper, all variables entering will be real. These variables 
may be conveniently divided into two classes corresponding respectively to 
the first and second arguments of the unknown function wu in the equations 
(A), (B). The first class of variables is denoted by such letters as x, y, z, 
—£,m, ¢, and they take on the values in the closed interval 


(B) 


q: aezz=b. 


We shall speak of this in the future simply as the interval J, , the subscript x 
indicating the variable referred to. 

The second class of variables is usually denoted by the letters s,t,r,0,7,p, 
which take on the values in the interval 


J: 


In the case of functions of two or more variables, each of which is confined 
to one of the intervals J and J, we interpret the different variables as rect- 
angular coérdinates. For instance, the unknown function u(x, s) will be 
supposed to be defined in the rectangle 


I,J: 


In case the variables belong to the same class, we shall have square regions 
I,,orJ,,. Likewise, for functions involving more than two variables we have 


Then Schlesinger considered also the associated homogeneous equation of the type (A) 
whose solutions are made dependent on the solutions of the former equation. These equations 
were also treated in a similar fashion in the notation of general analysis by T. H. Hildebrandt 
(these Transactions, vol. 18 (1917), p. 73). [After the manuscript of the present 
paper was in the hands of the editors of the Transactions, I was informed by them 
that a second paper by Hildebrandt was to appear shortly in the Transactions. See 
vol. 19 (1918), p. 97.] 
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such regions as I,,J5, Izy Js, ete. All these intervals and regions will be 
understood to be closed. 

To simplify our work, we shall assume, unless otherwise stated, that all 
functions considered are real and continuous (and therefore bounded) in the 
respective regions in which they are defined. By a solution of the equations 
(A), (B), or any other equation under consideration, we understand, with- 
out further specification, a continuous function. A solution of the equa- 
tions (4), (B), possesses a continuous first derivative with respect to its 
first argument. A solution which is identically zero will be termed a trivial 
solution. 


2. Tue INTEGRO-DIFFERENTIAL EQUATION 


The integro-differential equation 
0 
may be reduced, by means of the transformation* 


‘x 
s)dé 
u(a,s)=e 


where y is regarded as a fixed point in J,, to the equation 


Xo(2,t)dt = J. 8). 


This equation is the special case of (A) in which the second term of the first 
member is lacking. Let us write for convenience 


(2) n(?*) _ eG, 
so that 


Changing z in (1) into £ and integrating from y to x, we find 


v(x, 8) =0(y,8) +f 


-[ fv (1°) oe, 


* This was pointed out to me by Professor Birkhoff. 
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We may now transform back to u(2, s), getting 


This is a special case of the equation 


Let us then consider (4). 

The function @(¢%) (called the kernel of the equation) will be supposed 
to be continuous in J,;J,:. In its appearance, the equation is intermediate 
between the Volterra and the Fredholm types; but it behaves like an equa- 
tion of the Volterra type because of the variable limit of the first integral. 
Since Volterra’s method may be applied almost word for word,* we shall 
give here only the results. 

We are led by the method of successive substitutions to the consideration 
of the series 


(27) 


The series (5) converges absolutely and uniformly in I,; J:, thus representing 
a bounded continuous function, 6 (71), which shall be called the resolvent 
function of the kernel 6 (77). 

The kernel and the resolvent function satisfy the resolvent formule 


We now readily establish the 


(A’) 


* See Volterra: Lecons sur les équations intégrales, p. 74, where an equation is treated which 
is identical with (4), except that 8 is replaced by the variable s. The possibility of using the 
same method when one of the upper limits is constant was pointed out to me by Professor 
W. A. Hurwitz. 
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Lemma. If is continuous in and f(x, 8) is continuous in 
I, J, then the equation (4) has one and only one solution, namely 


Returning now to the equation ( A’), let 


where y is regarded as a fixed point in J,. Thus we obtain by the lemma, for 
each assigned function u(y, s), a unique solution of (A’) or (A), which, 
if we let 


may be put into the form 


u(z,8s)= +f oa 


rs 6 rs 
Hence, we have 


THeorEM I. The integro-differential equation (A) possesses one and only 
one solution which. reduces to the assigned initial function u(y, s) at the fixed 
point y in I,; this solution is given by the formulo (11). 

I. [If the integro-differential equation (A) is homogeneous, 2. e., 
if \(a,s) = 0, the solution has the form 


(11) 


II. The function 


is a particular solution of the non-homogeneous equation (A), corresponding 
to the initial function u(ys) =0. 

Observe that the integrand of the expression w(j*), when regarded as a 
function of x and s, is a solution of the homogeneous equation (A) for each 
constant value of €. Thus the particular solution w(;*) of the non-homo- 


368 MINFU TAH HU [October 


geneous equation (A) is built up from the solutions of the homogeneous 
equation by an integration. It is clear that every other solution of the non- 
homogeneous equation is obtainable by adding to the particular solution w a 
solution of the homogeneous equation. 

We shall also have occasion to apply the following: 

Corotiary III. The function @(}1), when regarded as a function in x 
and 8, is a solution of the homogeneous equation (A) , corresponding to the initial 
function 8 (41) = —W(,yi)aty. 

On account of the resolvent formula (6), we have 


ys\ (ys). 


and, on account of the first formula (9) and formula (2), 


ys\ 8 
(15) ¥(,1): 


Consequently, by combining (9), (14), (15), 


o(**) 
yt y yt 


and because of (10) the equation (6) becomes 


B 
)+ fo (27) ae, 
yt y J. \yo yt 


which is a solution by Corollary I. 


3. THe BounpaRY PROBLEM 
Let us now take a linear integral boundary expression of the following type: 
Ulu] =a(s)u(a,s) + B(s)u(b,s) 
(1) 8 
+f [A(s,r)u(a,r) + B(s,r)u(b,r)]dr, 


where a(s), 6(s8) are continuous functions in J,, A(s,r), B(s,r) are 
continuous functions in J,;, and a, b are the end points of the interval J,. 
Let us write from now on 


Ou(x, 8 


We shall consider the integro-differential boundary problems 


> 

3 
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(A) (B) Ulu] =y(s) 
and 
(Ao) Li[u] =0, (Bo) U[u] =0. 
It has been seen that all solutions of the non-homogeneous and the homo- 
geneous equations (A), (Ao) are of the forms 


8B 
(3) =e + f tat, 


rs 6 rs 
(4) = f 8 (21) tat, 


respectively, where y is a fixed point in the interval J, at which the initial 
function w(y,s) is to be assigned. Both y and the continuous function 
u(y,s%) are arbitrary. But in order to satisfy the boundary condition, it is 
clear that the initial function must be suitably chosen. 

Substituting in (1) the value of u(2z, s) from (3), we find that the boundary 
condition (B) reduces to 


B 
(5) a(t) uy ta = 


where 


9(y,8) = 


(6) 


a(,1) A(s,t)R("') + + 


This is an integral equation for determining the initial function u(y, s). 
Likewise, the equation (By) of the homogeneous system reduces to the 
homogeneous integral equation 


B 
@) + f u(y, tat 


Now we impose the further condition that a(s) and 8(s) be such that 
g(y, 8) do not vanish at any point of J,, so that the equations (5) and 
(7) may be reduced to integral equations of the second kind. Let us, 
then, examine this condition a little further by allowing the point y in the 
expression g(y,s) to vary inJ,. Now 


If g(y, s) + 0 for a particular value of y, then 


— J" 


a(s) + B(s)e + 0, 
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i. €., 


a(s) + +0. 

Conversely, if (C) is fulfilled, then we shall have g(y, s) + 0 throughout J, 

for each value of y in J,. Thus the condition (C) and the condition g (y, ) 

+ 0 are equivalent conditions, but it should be noticed that condition (C) 

does not involve y. Hereafter we shall always assume that (C) is fulfilled. 
Under the condition (C) the equations (5) and (7) become 


B 
(5’) u(y, 8) = Flys) +f K( )ucy, tat, 


7" u(yse) =f trae, 
where 
K( 


yt) 


g(y,8) 


F(y,8) 


The problem of solving the system (A, B) or (Ao, Bo) then reduces to the 
determination of the initial function u(y, s) from the equation (5’) or (7’). 
The initial function so determined will give the solution of the system upon 
substituting into the equation (3) or (4). 

As in the theory of differential equations, the homogeneous system (Ao, Bo) 
is said to be incompatible if it possesses no non-trivial solution; it is said to 
have compatibility of the kth order or index k if there are precisely k linearly 
independent solutions. 

Suppose w(2,s), «++, Un(2, 8) are linearly dependent solutions of the 
homogeneous system (Ay, By). Then there exist constants c,, «++, ¢,, not 
all zero, such that 

C1 U1 (2,8) + en (2,8) = 0 


identically in J, J,; in particular, 
C1 U1(y¥,8) tenuin(y,s) =0 


for a particular value y in J,. Conversely, if the initial functions u;(y, s) 
are linearly dependent, the solutions of the system formed by means of (4) 
will be linearly dependent. Hence 

TuHeoreM I. Under the condition (C), a necessary and sufficient condition 
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that the homogeneous system (Ao, Bo) have index k is that the integral equation 
(7’) have index* k. 

THEOREM I’. A necessary and sufficient condition that the system (Ao, Bo), 
subject to the condition (C), be incompatible is that the Fredholm determinant 
D(y) of the kernel K (,') of the equation (7’) does not vanisht; if D(y) van- 
ishes, the index of the system is finite. 

Since the condition (C) does not depend on y, the kernel K(,‘) of (5’) 
and (7’) and the Fredholm determinant D (y) exist for every y in I,. Conse- 
quently, if D(yo) + 0, the homogeneous system (Ao, Bo) will be incom- 
patible and therefore the equation (7’) can have only the trivial solution 
u(y,s) =O for every y. Hence 

THeoreEM II. The Fredholm determinant, D(y), of the equations (5') and 
(7’) either vanishes everywhere in I, or else vanishes nowhere. 

Every solution of (Ao, Bo) becomes, when z is changed into y, a solution 
of (7’). Conversely, however, a solution of (7’) will not, in general, when y 
is changed to x, become a solution of (Ao, Bo), as is seen by the fact that a 
solution of (7’) may be multiplied by an arbitrary function of y while a solution 
of (Ao, By) cannot be multiplied by an arbitrary function of x. Consequently, 
(Ao, Bo) is not, in general, equivalent to (7’). It is, however, equivalent in 
the special case in which (Ao, Bo) is incompatible, since then (7’) also has 
no solution except zero. In this case (A, B) has one and only one solution, 
which must, therefore, be the unique solution of (5’). Hence 

TuHeoREM III. The systems (A, B), (Ao, Bo) are respectively equivalent 
to the integral equations (5’), (7) whenever the homogeneous system (Ao, Bo) 
is incompatible; when (Ao, Bo) is compatible, they are equivalent to (5’), (7’) 
together with the auxiliary equations (3), (4) respectively. 

Coro.tuary. When the homogeneous system (Ao, Bo) is incompatible, the 
non-homogeneous system (A,B) has a unique solution, which is given by 


(9) u(x, s) +f Pod, 


where Q(.%) ts the resolvent function of the kernel K(.:) of the equations 
(5’), (7’). When the homogeneous system is compatible, the non-homogeneous 
system (A, B) possesses solutions if and only if 


(10) 2) F(z, = 0 


for all solutions $; (x, s) of the equation 


(11) o(2, 8) = foc 


~ * Bocher: An Introduction to the Study of Integral Equations, p. 45. 
{ Here the quantity y is regarded as a constant, but arbitrary. 
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4. INTEGRO-LINEAR INDEPENDENCE 


A linear integral expression 
U[u;s] = a(s)u(s) + B(s) u2(s) 


(1) 
+f [A(s,r)u(r) + B(s,7r)ue(r)]dr 


is said to be integro-linearly self-dependent in the interval J, (or simply self- 
dependent) if there exists a continuous function ¢(s) in J,, not identically 
zero, such that 

(2) e(s)U[u;s|]ds =0 
for every pair of continuous functions u;(s), w2(s); otherwise, it is said 
to be self-independent. Two linear integral expressions U,, U2 of the type (1) 
are said to be integro-linearly dependent (or simply dependent) if there exist 
continuous functions ¢;(s), ¢:(s), not both identically zero, such that 


(3) Us[u; 2] s])ds = 0 


for every pair of continuous functions u;(s) and u2(s); otherwise they are 
said to be independent. 

The above definitions of independence and self-independence are obviously 
a generalization of the notion of linear independence for a system of algebraic 
expressions. We shall derive some necessary and sufficient conditions for 
such dependence. 

THeoREM I. A necessary and sufficient condition that the expression (1) be 
self-dependent is that the equations 


a(s)e(s) +f e(r)A(r, ar 0, 


(4) 
B(s)e(s) + e(r)B(r,s)dr =0 


have a non-trivial solution c(s) in common. This function c(s) then satis- 
fies (2), and conversely every function c(s) which satisfies (2) also satisfies (4). 

The theorem is an immediate result of (2) when we observe that w(s) 
and w2(s) are arbitrary functions. 

CoROLLARY. A sufficient condition that U[u;s] be self-independent is that 
either one of the equations (4) possess no non-trivial solution. 

THEeorEM II. When the U of formula (1), § 3, is self-dependent, the homo- 
geneous system (Ao, Bo) of § 3, subject to the condition (C) throughout J, as 
considered, has always a non-trivial solution. 
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By hypothesis there exists a continuous function c(s), not identically 
zero in J,, which satisfies (2) and forms a common solution of the equations 
(4). Multiplying (4) by R(%*) and R()*) respectively and adding the 
results together, we have, by (6), § 3, 


+f [lar =0, 


and this, by (8), § 3 and by (2) reduces to 


g(y,#)e(s) =f 


As a solution of (7’), § 3, we have, then, 
u(y,%) g(y,s)e(s). 


Consequently the system (Ao, Bo) has a non-trivial solution. 

TuHeEoreEM III. If the homogeneous system (Ao, Bo), subject to the condition 
(C), is compatible and if the expression U is self-independent, not every semi- 
homogeneous system 
(A, Bo) Llu] =A(a,38), U[u] =0 
possesses a solution. 

Suppose the system (4, Bo) does possess a solution for every (2,8). 
Then, by the Corollary to Theorem III, § 3, 


B 
oi(y,8)F(y,s)ds =0 


for every solution of the equation (11), §3. This equation reduces to 


B 
8) u| = 0 
a 9(y,8) y 
because of (8), §3. Expanding U and substituting for w(j*) its value from 
(13), § 2, we find 


where 
8) oily, 7) 
g(y,8) a 9(Y,T) 


ies gi (y, 8) gi(y, 7) 


= a(s) A(r,s)dr, 
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We now find the following equations by assuming first that \(£,s#) is 
zero when = y while it is still arbitrary when £ < y, and secondly that it is 
zero when £ S y and arbitrary when & > y: 


as at 
R(¢ +f td =0 (a=tSy), 


B 
R( +f acy, tae =0 (ySéSb). 


Letting £ = a in the first equation and £ = b in the second, we obtain 
=0, =0. 


Consequently [¢;(y, s)]/[g(y,8)] is a solution of (4) which does not van- 
ish identically since ¢; may be assumed not to be identically zero. There- 
fore, by Theorem I, U is self-dependent, which is contrary to hypothesis. 
THEOREM IV. A necessary condition that two linear expressions U,, U2 of 
the type (1) be independent is that each expression be self-independent. 
THEOREM V. A necessary and sufficient condition that the self-independent 
expressions U;, U2 be dependent on one another is that the equations 


B 
=0, 
B 
Br (#)ex(s) + + f 
=0 


(6) 


possess a common non-trivial solution c,(s), c2(s). These functions c(s), 
¢2(s) then satisfy (3); and conversely every pair of functions c,(s), ¢2(s) 
which satisfy (3) also satisfy (6). 

These theorems follow immediately from the definitions of dependence 
and independence so that no proof will be needed. 

Let us now consider, more in detail, the case in which, for every value of 
sinZd,, 
(D) A(s) = ai(8) a2(s) | 


|Bi(s) Be(s) | 


Equations (6) may now be reduced to 


+0. 


(7) =f [Ka(s,t)ei:(t) + (i=1,2), 
where 


(8) Ki(s,t) = 


(— 1)"| as(s) A;3_;(t, 
A(s) |Bi(s) Bs;(t,s) 
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We shall call the set of functions 

(9) 

Ku(s, Koo (s, t) 


the kernel-system of (7). 
By the side of (7) we consider the associated non-homogeneous system 


(10) = + ther(t) + t)ea(t) lat (¢ = 1,2). 


We define with Fredholm two new intervals 
a Sesh: JP: a Seah, 
such that a; < $i = a2 < fo, 
Bi — a, = = 


Let J; denote the combined interval of J‘? and J, so that 


F(s)de+ f F(s)ds. 
ye) 


We have also four square regions J‘; (7, 7 = 1, 2) to consider, and J;, 
will be used to indicate the totality of all the squares. Then we will map 
our functions into the new intervals and regions in such a way that the func- 
tions K;; composing the kernel system each occupy one of the four squares. 
That is, we define 


K(s,t) = Kij(s -a: ta, t—aj+a) 
(11) f(s) =fil(s a) 
e(s) = ¢;(s —a; +a) Ji =1,2). 


According to this notation, the equations (11) have the form 


(12) = f(s) + [ K(s, 


This equation may be treated as an ordinary Fredholm equation by forming 

the Fredholm determinant, A, and first minor, A(s, ¢), of the kernel K (s, ¢) 

in the usual way. We shall call A the determinant of the kernel-system (9). 
If A + 0, the resolvent function 


Q(s, t) = 
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satisfies the relations 


Q(s,t) = K(s,t) + Q(s,0)K(o,t)de, 
(13) 
Q(s,t) =K(s,t) + [ K(s,0)Q(a, t)da; 


and the equation (12) has one and only one solution, 


(14) + f tf (t)de. 


Returning to the old coérdinates, we can define 
Qi (s,t) =Q(s (for JJ 


This is called the resolvent-system of the kernel-system K;;. Equations (13) 
become 


8 
Qis(s,t) = Ki(s, t) +f [Qia(s,0) Ki; (¢, t) 
+ Qi2(s,0) Koj (o,t)Jdo, 
(s,t) = Kij(s, t) +f t) 
+ do, 


(15) 


and the solution (14) takes the form 


(16) =fils) +f (Qals + Qala, 


We now easily infer the truth of the following lemmas: 

Lemma I. A necessary and sufficient condition that the system (10) possess a 
unique solution is that A + 0. If this condition is satisfied, the solution is given 
by formula (16); and, in particular, the trivial solution will be the only solution 
of the homogeneous system (7). 

Lemma II. When A = 0, the system (7) always possesses non-trivial solu- 
tions; and a necessary and sufficient condition that the system (10) have solutions 
is that the equation 


B 
(17) f [¥i(s) fils) + = 0 


be satisfied by every solution y (s) of the equation 


(18) vis) = f vce K(t, 


By combining Lemma I and Theorem V we have 


- | 
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TueoreM VI. Jf U,, U2 are self-independent and fulfill the condition (D) 
for every value of S in J,, then a necessary and sufficient condition that they be 
independent of each other is that the Fredholm determinant A of the kernel-system 
(9) be different from zero. 

TueoreM VII. If U,, U2 fulfill the condition (D), the equations 


(19) Ui=¢1(8), Us = ¢2(s), 


when regarded as equations in u,(s) and u2(s), possess a unique solution if 
and only if U,, Uz are independent. This solution is integro-linear in ¢, and ¢2. 
For, if we replace u; and uz by 


01(8) = a2(s)ui(s) + Bo(s)ue(s), 
v2() = a1(8)ui(s) + Bi(s) use(s), 


and let fi = ¢2, fo = $1, equations (19) become 


8 
v;(s) tf (r) Kii(r, 8) + v2 Koi(r, 8)]dr (¢=1,2), 


or simply 


v(s) =f(s)+ K(r, ar. 


This equation however has precisely the transposed kernel K (r,s), so that 
it has a unique solution when and only when A + 0. The second part of 
the theorem now follows readily. 

Corotitary I. Jf (D) is fulfilled, the homogeneous. equations = 0, 
U2 = 0 possess non-trivial solutions when and only when U,, U2 are dependent. 

II. Jf is such that a; (s), do not vanish together* and 
if U, = 0 admits no non-trivial solution, then every self-independent Uz which 
fulfills (D) is independent of 

An important application of this corollary is that for a given self-independent 
U, in which a; (s), 81(s) do not vanish together, if there can be found a U2 
such that (D) is satisfied and for which U2 = 0 admits no solution other than 
the trivial one, then U,, U2 are independent. Unfortunately, I have as yet 
been unable to determine whether such a U2 always exists. We shall have 
to leave this important general problem without giving a definite answer. 
Instead we shall only show the following fact which includes several important 
special cases in which we know U>2 can be found. 


“* Obviously ( D) cannot be fulfilled if a:(s), 8:(s) do vanish together. On the other 
hand, if a (s), 6: (s) do not vanish together, there always exist functions a2(s), B2(s) 
such that ( D) is fulfilled, for instance ag = — 61, B2 = am. 
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TueorEM VIII. [f for a given self-independent expression 


Ui [u] = a1 (8) ui(s) + Bi(s) use(s) 
B 
+f Pale) + ile, 


in which a,(8), B1(s) do not vanish together, there can be found constants ky, 
ke such that ky (8), ke By (8) do not vanish together and such that 


Ui [u] = ki ar (8) + ke Bi (8) ue (s) 
8 
+f [ky + ke Bi (8, r)ue(r)]dr =0 


admits no non-trivial solution, then it is possible to find a U2 such that (D) is 
fulfilled and that U,, U2 are independent. 

For, suppose k; , ke are both different from zero, then the theorem is obvious, 
because if we group the constants k,, k. with the unknown functions u;(s), 
us (s) respectively, then U; will have exactly the same form as U, so that 
they both can have no solution. By Corollary II, a U2 exists. 

If ko = 0, then we must have k; + 0, a: (s) + 0 for every value of s in J,, 
and, further, on dividing U; by k:, the equation 


8 
= a (8)u,(s) + A,(8,r)u,(r)dr =0 


has no non-trivial solution u;(s). Now if we define 


Us, = a1 (8) + Ai(s,7r)ue(r)dr, 


then U, = 0 will have no non-trivial solution w2(s), so that the equations 
U, = 0, U,. = 0 together will admit only the trivial solution w(s) = 0, 
u2(s) = 0. The same argument will enable us to construct a U2 for the 
case ky = 0. 


5. Tue Apjsornt INTEGRO-DIFFERENTIAL EXPRESSION 


Derinition. The integro-differential expressions 


9) (2, 8)u(z, 8) t) dt, 


B 
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are said to be adjoint to each other; the equations 
(Ao) L{u] =0, 
(Ay) =0 


are called adjoint equations. 
If we multiply L[u], M[v] respectively by v(2, s) and u(2, s), integrate 
with respect to s, and subtract the results, we find 


which may be called Lagrange’s Identity. Integrating again, with respect 
to 2, we have the Green’s theorem: 


oL[u] — uM [v] ] dsdz 


(2) 
ds. 


These relations hold for any continuous functions u(2,s) and v(as), pro- 
vided they have continuous first derivatives with respect to z. 
Let us write for convenience (Ao) in the form 


(Aj) — Mio] =) 0) + f V( tat =0. 


A dash above a function will be used here consistently to indicate the corre- 
sponding function of the adjoint equation. The solution of (A,) may then 
be written 


B_ 
(3) +f tat. 


There are important symmetrical relations between the functions 


n(2*),8(2*), R(2*), ana 5(7*). 
y yt y yt 


To obtain such relations, let us apply Green’s theorem to the solutions of 


(Ao) and (Aj). For such functions, u,v, Green’s theorem becomes 


B B 
(4) f = 8)v(a2, 8) ds 
for any pair of values 2, 22 in J,. 
Let x3, x, be respectively the points at which the initial functions of u, » 
Trans. Am. Math. Soc, 25 
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are to be assigned. Then, by (12), § 2, the solutions have the forms 


8 
u(x, 8) +f 4) u(as, tat, 


Substituting into (4) and regrouping the terms, we find 


(as, = (0, 


where 


Since the initial function u(23, 8) is arbitrary, we conclude that f(s) =0. 
Moreover, the initial function v (24, s) is also arbitrary, so that, by the lemma 
to be proved presently, we obtain the following identities: 


- 
+f 


These relations hold identically in T,,222,Js and respectively. 
In particular, if we let z = 2; = 23, y = %2 = a, we have* 


8 yt 
y x yt xs 
* The first relation (6), and also the first relation (7), may also be inferred from the defini- 
tion of R; see (2), § 2. 
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Letting x = 21, y = 2% = 23 = 2,, we have 


y y 
(7) 
x 
y yt y ys A ys yt 
A special case of interest is when L[ wu] is anti-self-adjoint, i. e., when 
L{u] = — M[u]. In this case, we must have 


(8) o(z,0)=0, 


Consequently, we have 


We will now prove the lemma which we have referred to, and which will be 
useful again later. 
Lemma. If h(s) and H(s,t) are continuous functions such that 


(10) b(s)o(2) + = 0 


for every continuous function @(s), then h(s) =O and H(s,t) =0. 
It is sufficient to show h(s 9) = 0 when a < 8 < B, because it will then 
follow from the continuity of h that h(s) = 0, and therefore H(st) = 0. 
Let so be any interior point of the interval J,. Let a particular function 
(8s) be defined as follows: 


0 for |s 
o(s) =41 for $ = %, 
continuous, positive and = 1 for |s —s| Se. 


Then from (10) 


h( so) +f H (8, t)o(t)dt = 0. 


By the first law of the mean, 
h( 80) + 2eH (80, = 0, 
where 8% +e. Let | H(s,t)| < M, then 
| h( so) | = 2e6(ti) | H (80, t1) | < 2eM; 


hence = 0. 
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6. A Mopirrep Form ror GREEN’s THEOREM* 
Let U;[u;s], U2[u; 8] be the two integro-linear forms 
U;[u; = a;(s)u(a,s) + Bi(s)u(b, 8) 
(1) 


B 


Regarding (1) as equations in u(a,s) and u(b,s), and U,[u;s], Us2[u;s] 
as known functions of s, it is seen (Theorem VII, § 4) that if the condition 
(D) is fulfilled, it is possible to solve for u(a,s), u(b, s) uniquely in terms 
of U, and U2, provided the forms are independent; and furthermore, that 
the unique solution will consist of two integro-linear forms in U; and U2 of 
the same form as (1). In this case, the second member of Green’s Theorem 


((2), § 5), in which we put 2; = a and 2 = b, thus becomes 


where 8], V2[v; 8] are integro-linear forms in v(a, s) and v(b, s) of 


the form 


Vi[v; 8] = ¥i(s)0(a, 8) + 6;(8)0(b, 8) 


(3) 


Thus we see that Green’s theorem may always be written in the form 


f 2) u(x,s)M[v]dsdz ) 
vale 


-f (Ui [u; V2[v; 8] + Us[u; 8] Vile; 8]) ds 
if U; and U2 are independent and satisfy condition (D). 

Now, suppose (J) is not satisfied, or that (D) is satisfied but U; and U2 
are dependent. Will it be still possible to determine V; and V2 so that the 
identity (2) will hold? Let us find the conditions under which V;, V2 can 
be determined so as to satisfy (2). 

Assuming that U,, U2 have the form (1) and V;, V2 the form (3), let us 
then determine the continuous functions y;, 6;, C;, D; (4 = 1, 2) so that 


*In connection with §§6, 7 see the corresponding developments for differential equa- 
tions given in these Transactions by Birkhoff, vol. 9 (1908), p. 373, and Bécher, 
vol. 14 (1913), p. 415. 
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(2) holds for every set of continuous functions u(a,s), u(b,s), v(a,s), 
v(b,s). It may be remarked here that the notations U;[u;s], U;[u], 
U;(s), U; will be used indiscriminately for convenience, the same being 
true for V;. 

On substituting (1) in (2) and equating the coefficients of the arbitrary 
functions u(a,s) and u(b,s), we obtain 


ax(s) Valo; 9] + ax(#) Vales] (Vales) 


+ Ve[v;r] Ai(7, 8) ) dr = —v(a,s), 
(5) 8 
Ba(s) Vale; + + (Vi[v; r] Bo(r, 8) 


+ V2[v;r] Bi(r, 8) )dr = 8) 


as a necessary and sufficient condition that U,, U2 defined by (1) should 
satisfy (2). Substituting in these equations the expressions for V; and V2 
from (3) and gollecting the coefficients of the arbitrary functions v(a, s) 
and v(b, s), we find, by the lemma proved at the end of § 5, that the following 
identities give a necessary and sufficient condition for V;, V2 as defined by (3) 
to satisfy (5): 


ao (8) ¥i(s) + ar(s)y2(s) +1=0, 
Bo(s)¥i(s) + Bi(s)¥2(s) = 90, 
ae + a1(s)62(s) = 0, 


(6a) 


(6b) 
B2(s)61(8) + Bi(s)62(s) —1=0, 


a2 (s)Ci(s,7r) + + 71(7) 8) 
+ ¥2(7r) Ai(7, 


B 
+ (Aa(t, C(t, + #) 7)) dt = 0, 


Bo(s)Ci(s,r) + Bi(s)C2(s, 7) + Bo(1, 8) 


+ v2(r) Bi(r, 


(Ba(t, 9) + Br (t, #)C2(t, r)) dt = 0, 


( ‘ a) 
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+ a1(8)Do(s,r) + 6:(r) Ao(r, 8) 
+ 62(r) Ai(r, 8) 


8B 
+f (Ao (t, 8) Di(t, r) + A;(t, s)De(t, r))dt = 0, 


7b 
B2(s)Di(s,7r) + Bi(s)Do(s,7r) + 6: (7) Bo(r, 8) 


+ Bi(r, 8) 
8 
+f (Bo (t, 8) Di(t, r) + By (t, s)De(t, r))dt 


Thus these eight equations form a necessary and sufficient condition that U,, 
Us, Vi, V2 as defined by (1) and (3) satisfy (2). We will now inquire under 
what conditions the continuous functions y;, 5;, C;, D; (4 = 1, 2) can be 
determined to satisfy equations (6a), (6b), (7a), (7b). 

If A(s) + 0 for every value of s in J,, there will be a unique solution of 
equations (6a) and (6b), namely 


On the other hand, if for a particular value, s9, we have A(s 9) = 0, then, 
in order that the matrix and the augmented matrix of the system (6a) have 
the same rank, we must have 6;(s 9) = B2(s)) = 0. But this cannot be 
the case, as we see from the second equation (6b). Consequently, y;, 6; 
cannot be determined when the condition (D ) is not fulfilled. The condition 
(D) is then a first necessary condition that we have to impose on U,, U2 
in order that the problem in question be possible. 

Assuming then that (D) is satisfied by U, and U2, let us now consider 
the system (7a). 

Using the notation (8), § 4 and letting 


_| mr) Ke(s,r) 
Filet) =| _ 


di(r) Ke(s,r) 
gi(s,r) = —dbe(r) Ka(s,r)’ 


(9) 


the equations (7a) may readily be reduced to the form 


B 
(10) Ci (8,7) = f;(s, [Ka(s, t)Ci(t,r) 
+ (i= 


In this form we have precisely a system of equations of the type (10), § 4. 


384 
= 
1,2). 
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Now if the Fredholm determinant A of the kernel system K;;(s,¢), (i,j = 1, 
2), is different from zero, we have by Lemma I, § 4, a unique solution of the 
equations, which is given by 


8 
Ci(s, r) =fi(s, r) +f [Qa (s, t)fi(t, r) + Qz(s, t)fo(t, r) 

(¢=1,2). 

Because of (9) and the resolvent relations (15), § 4, this solution simplifies into 


(s, 
— Qa(s,r) 


and because of (8) it further reduces to 


1 |Bi(r) Qe(s,r)) 
(11) C;(s,r) = Bo(r) Qa(s_r)| 


A(r) 
Similarly, for the system (7b) we have the unique solution 


1 jai(r) Qie(s,r)| 


On the other hand, if A = 0, solutions of (7a), (7b) both exist by Lemma II, 
§ 4, if and only if 


C:(s, 17) -| 


B 


B 


for every non-trivial solution, (s), of the equations 


B 
(13) = [va(t)Ku(t, #) + (= 


Suppose both of these conditions are satisfied. Substituting the values of 
fi, fo, 915 92 from (9) and 71, 72, 61, 52 from (8), we have 


[¥i(s) Ku(s,7) + ¥2(s) Kai(s, r)] ds 


9 
[yi(s) Kio(s, + Yo(s) Koo(s, r)] ds 


a, 


B 
[¥i(s) Kiu(s,r) + ~o(s) Koi (s, r) J] ds 


ae (rT) 


Kuls, r) + ¥2(s) Koo(s, ds 


(i= 
(i = 
2). 
(i=1, 
(12) 
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These equations may now be regarded as a system of linear algebraic equations 
whose determinant, A(r), does not vanish for any value of rin J;. Whence 


Kus, r) =0 (i=1,2), 


i. e., Vi = 0, Yo = O because of (13). But this is contrary to the fact that 
¥i(s), ¥2(s) are a non-trivial solution of (12). Hence C;, D; cannot be 
determined when A = 0. Thus we have A + 0 as a second necessary con- 
dition to be imposed on U,, U2; that is (§ 4, Theorem VI), U;, U2 must be 
independent in addition to fulfilling the condition (D). Hence 

THEOREM I. A necessary and sufficient condition that the expressions V;, V2 
of the type (3) be determinable so that the identity (2) holds for every set of con- 
tinuous functions u(a,s), u(b,s), v(a,s), v(b, 8), tw that U,, Use fulfill 
condition (D) and that they be independent. The determination is unique and 
given by formulas (3), (8), (11), (12). 

Now let us suppose that we start from the expressions V;, V2 just deter- 
mined and that we try to determine U,, U2 so as to satisfy (3). We form the 
determinants A(s), A for the expressions V;, V2 corresponding to the deter- 
minants A(s), A for U,, Uz, and denote by (D) the condition that, for 
every value of s in J,,A(s) +0. Then, by the theorem just stated, since 
U,, U2 do exist, we have the 

Corotiary I. Jf U;, Us are independent and fulfill the condition (D), 
then the expressions V1, V2 are also independent and fulfill the condition (D) : 
"Thus in this case the two sets of expressions are uniquely determinable from 
each other. 

We see that the necessary and sufficient condition of Theorem I is precisely 
a necessary and sufficient condition that the system U; = 0, U2, = 0, admit 
no non-trivial solution (Theorem VII, Corollary I, § 4). Hence 

Corotuary II. Jf Ui, Us are such that the system U,; =0, U2 =0, 
admits no non-trivial solution, then V,, V2 can be determined and they are such 
that the system V,; = 0, V2 = 0, admits no non-trivial solution. 

The following fact will be useful later. 

Coroutuary III. If Vi, Ve exist, then u(a,s), u(b, s) can be uniquely 
expressed in terms of U, and U2 in the form 


u(a,s) = — y2(s)Ui (8s) — (8) U2(8) 


B 


(14) 
u(b,s) = 62(s)Ui(s) + 6:(8) U2(s) 


8 
+f [Ui (r)De(r, 8) + U2(r)Di(r, s)]dr. 
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The existence of a unique solution follows from Theorem VII, § 4, and 
there it is also shown that the solution is integro-linear in U; and U,. Thus 
we need now only to verify the formule (14). For this purpose we assume 


B 
7) + (9, 7) U(r) 
u(b, 8) = + Us(s) 


B 
+ [Bi(s, 7) + 7) 


Substituting in the expression 


2) 8) —u(a,s)v(a,s)]ds 


and collecting the coefficients of U; and U2, we find 


2) 0,8) — u(a,s)v(a,s)]ds 


where 


Vi(s) = — aj (s)v(a,s) + Bi (s)0(b, 8) 


(16) 
— [v(a,r)Ai(r, 8) — 0(b, r) Bi (r, 8)] dr. 


But (15) is exactly the identity (2). Since we have seen that for each given 
set of U, and U2, the expressions V;, V2 are uniquely determined and are 
given by (3), the expressions (16) and (13) are identical. Hence we have 
the formulas (14). 


7. THe ADJOINT SysTEM 


It has been seen that the expressions V;, V2 are uniquely determined for 
each U, integro-linearly independent of U, and fulfilling (D). Now let U; 
be another expression independent of U; and let V;, V: be the corresponding 
expressions thereby determined. We are to see how the two sets of V; are 
related to one another. 

The two sets of expressions, U;, U2, Vi, V2, and U1, U:, Vi, V2, satisfy 
the identity (2) of §6. Consequently, 


91 + Us[u; s]Vilv; 8]) ds 


8 
= (Ui[u; 8] Vi[2; 8] + Uz[u; 8] 8])ds. 


| 


388 MINFU TAH HU [October 


Let U;, U2, U2 be written in their full form and the coefficients of the arbitrary 
functions u(a,s),u(b, 8) be equated to zero; we obtain 


a1 (s)V2[0; 8] + a2(s) 8] 


+f (Ai(r, 8) 17] + Ao(r, 8) Vilv; r]) dr 


a 


[0; 8] + a2 (8) Vi[2; 8] 
B 
+ (Ai(r, 9) Vales + (r, 8) de, 
Bi (8) Ve2[v; 8] + Be(s)Vilr; 8] 
+f ( Bi(r, s) + 
= B:1(s)V2[0; 8] + B2(s8) 8] 
+ (Bir, 9) Vales #) Miles dr. 


Let us denote by ®[ V’; s], ¥[ V’’; s] respectively the expressions on the right 
of these equations. Since A(s) + 0, we find 


Vi = F;[V’; 8] +f (Ka(s, r)Vilo;r] + Ke(s, r)V2[v;r]) dr 
(¢=1,2), 
where 
(—1)las(s) 
F,[V’;2] = 
[Bi(s) 8] 
The expressions F,, F2 are integro-linear and homogeneous in V; and V3. 


As A + 0, these equations may be solved for V; and V2, and the solution 
is unique, having the form 


8 

Vi[o; = F;[V’; 2] +f (Qia(s,7r) Fil V’; r] + 7) r])dr 

(¢ = 1,2). 
The expressions in the second member are obviously integro-linear and homo- 
geneous in V; and V;, so that these equations may be regarded as an integro- 
linear transformation between the expressions Vi, V2 and V;, V;. Upon 
simplifications due to the resolvent relations (15) of § 4, these equations take 
the fina! form 


B 
Vilv;s] = Mi (s)Vi[2; 8] +f 
(2) 


9] = + + Vi los rldr. 
a 4 
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Similar equations may be obtained for expressing V;, V; in terms of V1, V2 
by solving (1) for V;, V2. It is important to notice that both of these integro- 
linear transformations are unique, since all the coefficients depend only on 
the coefficients of U;, U2, and U:. 

The importance of the equations (2) lies in the fact that V; is integro- 
linear and homogeneous in V;, so that whenever the boundary condition 
V; = 0 is satisfied, the condition V, = 0 is also satisfied, and vice versa. 
For this reason, we may state: 

THEOREM I. The condition V; = 0 is essentially determined by the condi- 
tion U; = 0, and conversely. 

DeriniTion. A pair of boundary conditions U; = 0, V; = 0 are said to 
be adjoint to each other if U;, V; satisfy a relation of the form (2), § 6, where 
U2 is independent of U; and the condition A(s) + 0 is fulfilled. The systems 


(Ao) L[u] =0, (Bo) Ui{u] =0, 
(Ao) M[v] =0, (Bo) Vilv] = 0 


are called adjoint systems. 

It follows from Theorem VIII, § 4, that an adjoint boundary condition 
always exists if the function a;(s) does not vanish in J, and the Fredholm 
determinant of [ — A;(s,7)]/[a:(s)] is not zero; and also under certain 
more general conditions there specified. 

As we have done in § 3, we will restrict ourselves to the case in which the 
system (Ao, Bo) is subject to the condition 


b 
(C) ar(s) + +0. 
If we consider the adjoint system (Ao, Bo) , we find that a similar condition 
(6) #0 


is fulfilled. For, from the formule (8), § 6 and (6), (7), § 5, we have 


“acon (*) 


Hence 
TueoreM II. [f the system (Ao, Bo) fulfills the condition (C), then the 
adjoint system (Ao, Bo) fulfills a similar condition (C). 
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We shall next prove a 

TuHeorEM III. The adjoint systems (Ao, Bo), (Ao, Bo), subject to the 
condition (C), have the same index. 

Let n be the index of the system (Ao, Bo) and m that of the adjoint system 
(Ao, By) . Let wm, +++, Up and 2%}, «++, be respectively complete sets of 
linearly independent solutions of the systems. Let wu be any solution of the 
equation (Ag), and v any solution of (Ao). 

Applying Green’s theorem to u and v;, we have 


8 
(3) U,[u; s] s]ds = 0 


for all solutions u of (Ag). As before, let y be any fixed value of x at which 
the initial function u(y, s) is assigned. Then, by formula (7), § 3, 


B 


where g(y,s) and G(,%) are given by (6), § 3. Because (3) has to hold 
for all continuous functions u(y, s), we have 


g(y, 8) 8] +f 7) ar 


B 
g(y, 8) v3 8] ar, 
That is to say, ¢:(y,s) = g(y,8)Velv;s], (¢ =1,2,---,m), are solu- 
tions of the equation 


On the other hand, the initial functions of u,, ---, uw, form a complete set of 
linearly independent solutions of the equation 


(5) =f K( u(y 


adjoint to (4). Hence if it can be shown (as we will now do) that the func- 
tions V2[v;;s] (¢ = 1,2, +--+, m) are linearly independent, then ¢;(y, s) 
will constitute m linearly independent solutions of (4), and therefore m = n. 
For, suppose V2[v;;s] (¢ = 1,2, «++, m) were linearly dependent. Then 
there would exist constants ¢,, ---, Cm, not all zero, such that 


Let us define 
Vo = C10 + +++ +m Um. 


or 
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Then 7% is also a non-trivial solution of the system (Ao, Bo), and satisfies 
Vo[0; 8] = 8] +--+ 8] = 0. 


But this is contradictory, because V; = 0, V2 = 0*admit no non-trivial 
solution. This completes the proof that m =n. 

In the same manner, the functions ¥;(y,s) = g(y, 8s) Use[ui;s] (¢ =1,2, 
--+, mn) form n linearly independent solutions of the equation 


where K has the same meaning in the adjoint system (Ao, Bo) as K has in 
the original system (Ao, By). On the other hand, the initial functions of 
01, °**, Um form a complete set of linearly independent solutions of the equation 


(7) = ff na 


adjoint to (6). Hence nm =m. When combined with the previous result, 
we have m = n. Thus we have established Theorem III and also 

THEOREM IV. Let uw, «++, Un be a complete set of linearly independent 
solutions of the system (Ao, Bo), and %1, +++, Un @ complete set of linearly inde- 
pendent solutions of the adjoint system (Ao, Bo). Then the functions 


(8) oi(y,8) = 9(y,8)Ve2[ 8] (4 = 1,2, +--,n) 


form a complete set of linearly independent solutions of the equation (4), and the 
functions 
(9) ¥il(y,s) =g(y, 8) 8] 
form a complete set of linearly independent solutions of the equation (6). 
If we replace ¢;(y, 8) in the equation (10) of §3 by the values (8), we 
obtain from the second part of the Corollary of Theorem III, § 3, the 
THEOREM V. A necessary and sufficient condition that a non-homogeneous 
system (A, B), subject to the condition (C) , possess a solution when the reduced 
system (Ao, Bo) is compatible and when the adjoint system (Ao, Bo) exists, 
is that 


(10) 8) Valve = 0 


for every v; which satisfies the adjoint system (Ao, Bo). 


By means of (8), § 3, this condition may be given the form 
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8. Tue BouNDARY CONDITIONS 


We have shown that two different choices of the auxiliary boundary ex- 
pression U, independent of the given U; and fulfilling the condition (D) 
lead to two expressions V; which are connected by an integro-linear trans- 
formation. Furthermore, this transformation is unique in both ways. This 
fact is important for us here, because in seeking the conditions that a given 
expression U, be self-adjoint, it is sufficient to seek the conditions that a par- 
ticular V, thereby determined be connected with U; by an integro-linear 
transformation.* It is clear that if one particular V; is integro-linearly con- 
nected with U,, then every V; will be so connected. 

Suppose the condition U; = 0 is self-adjoint, and that, for a particular 
choice of U., we have 


B 
(1) Vi[u;s] = M(s)Ui[u;s] + N(s,t)U,[u; t] dt, 
where 
U,[u; 8s] = a (s)u(a,s) + Bi(s)u(b, s) 
B 
+ (Als, r)u(a,r) + r)u(b, 
Vilu;s] = yi(s)u(a,s) + 61(8)u(b, s) 
8 
+f r)u(a,r) + r)u(b, r) dr, 
the functions y;, 6:, Ci, D; having the values given by the equations (8), 


(11), and (12) of §6. The equation (1) may be thrown into the following 
form 


Vilu;s] = M(s)ai(s)u(a,s) + M(s)Bi(s)u(b, 8) 


N(s,t)Ar(t, r)dt]u(a, r)dr 


8 
+f [M(s)Bi(s,r) + N(s,7r)Bi(r) 


N(s,t)Bi(t,r)dt]u(b,r)dr, 
whereby we obtain ° 


(2) vi(s) = M(s)ai(s), 


- * Professor D. Jackson takes the same point of view in his article, in these Transac- 
tions, vol. 17 (1916), pp. 418-424. 
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(2’) = M(s)Bi(s), 
B 
(3) Ci(s, r) = M(s)Ai(s,7r) + N(s8, r)ai(r) +f N(s,t)Ai(t, r)dt, 


(3’) Di(s,r) = M(s)By(s,r) + N(s,1r)Bi(r) +| N(s, t)By(t, r)dt. 


Equations (2) and (3) may be regarded as determining the functions M 
and N; equations (2’) and (3’) then constitute the conditions which must be 
imposed on U;, in order that it be self-adjoint. 

Substituting the values from (8), § 6, for y: and 6,, the equations (2), (2’) 
become 


A(s)M(s)ai(s) — Bi(s) 0, 
a,(s) —-A(s)M(s)Bi(s) =0. 


Now a (s) and 6;(s) cannot vanish together since we must have A(s) + 0 
throughout J, in order that the adjoint expressions exist. Hence, we must 
have 
A(s)M(s) 1 
1 A(s)M(s) 


for every value of sin J. That is, 
1 
M(s) = 
It follows that 


(4) ai(s) = + Bi(s) +0 (throughout J,). 


Conversely, when (4) is satisfied, 
1 
Equation (4) is a first necessary condition. 

Assuming then that (4) is satisfied, let us proceed to consider the equations 
(3), (3’). These equations are Fredholm equations with the kernels 
— [Ai(t,7r)]/[ai(r)] and respectively. It is con- 
ceivable that the Fredholm determinant of either one of these kernels might 
be zero. We shall now show that in such case, no self-adjoint system is 
possible. 

Let us suppose the Fredholm determinant of the kernel — [Ai (¢, r)]/ 
[ a; (1)] to be zero, and a self-adjoint expression to exist so that the equation 
(3) has a solution. For this it is necessary that 


B Ci(s,1r) 1 A;(8,7) 
(9) J 6(r)| £A(s) ar(r) 
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for every non-trivial solution ¢(r) of the transposed equation 


Ay, 


The + signs correspond to those of (4). Because of (6), the condition (5) 
has the form 

®Ci(s,7r) 

+ A(s) ai(r) 


or, since 


(7) 
Let us put 


Then equations (6) and (7) become 
U,[u;s] = 0, Vi[u;s] = 0. 


The identity (2), § 6 now becomes, if we let u(2,s) =v(2z,38), 


o(s) 
~{ Ral 

whence ¢(s) =0. But this is contrary to the fact that ¢@(s) is a non- 
trivial solution of (6). Thus we have derived a second necessary condition 
for the existence of self-adjoint expressions, namely, that both* — [ A; (t, r)]/ 
[ai(r)] and have non-vanishing Fredholm deter- 
minants. This condition is sufficient to insure the existence of a unique 
solution for each of the equations (3), (3’), and we shall have a third necessary 
condition upon equating these solutions to each other. It is also clear that 
these three necessary conditions combined are also sufficient for the existence 
of self-adjoint expressions. 

To determine the explicit form of the third condition, it is convenient to 
choose a particular U2 which will simplify the computation. We shall choose 
for instance U2 such that 


a2(s)=0, Pr(s)=1, A2(s,r) = B2(s,r) =0. 


This Us is integro- linearly independent of the given U,. To prove this we 


 * The proof for — [B,(t,r)]/[8:(17)] proceeds exactly in the same way. 


(s) _ Bi(s) + ai(s) 

* 

u(a,#) u(b,s) =0. 
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note that A(s) = a:(s) + 0inJ,; and according to the notations of § 4 
Kiu(s, r) = Koi(s, r) 0, 


(throughout J,,). 


a ($) 
Consequently K (s, r) = 0 whenever the second argument r is in the interval 
J, Let us write for short 
A,(r,8) Bi(r, 8) 

ai(s) ’ Bi(s) ’ 
and denote their resolvent functions, w rhich by hypothesis exist, by A’(s, r), 
B’(s,r). Then 


This is different from zero, because it is precisely the Fredholm determinant 
of the kernel A(s,7r). This completes the proof that U2 is integro-linearly 
independent of U; by Theorem VI, § 4. 

From the resolvent relations (15), § 4, we have also the following further 
facts: 


A(s,r) = B(s,r)=- 


Qu(s,7r) = = 0, 
B 
Qo2 (8, r) = Kx (s, r) +f (8, t) Koo (t, r) dt, 


B 
Kale, +) +f (8, t) (t, r) dt, 


8 
= Kw(s,7r) +f Ki2 (8, t) Qoo (t, r) dt, 


whence Q.2.(s,7r) = A’(s,r). Furthermore, 
(s r ) 
ai(r) ’ 


If we let ai (s) = + Bi(s) = 1, as we may do without loss of generality, 
we have 


A(s) =1 A(s,r) = — Ai(7’,38), B(s,r) = + B(r,3), 
Ki2(s, 7) = + B(s,r) A(e, 7), 


Ci(s,r)=- Di(s,r) =0. 


— Ci(s,r) = Qi(s,r) = F A’(s,r) + B(s,7r) +f B(s,t)A’(t,r)dt, 


Trans. Am. Math. Soc. 26 
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and the equations (3), (3’) have the following form 
B 
N(s,r) = Ci(s,r) + A(r,38) +f A(r,t)N(s,t)dt, 


8 
N(s,r) = + +f B(r,t)N(s,t)dt. 
Solving, 
8 
N(s,r) £A(r,9)] +f ACE, 


8 
+ Ee r)+A’(r,3) +f t)dt | BOs, r) 


+ B(s,0)| + A’(r,o) 
+f tat |ao, 
N(s,r) = + @(r,s) B(r, = + B’ (r,s). 
Equating and transposing, 


Exe r)+A’(r,s) + | = B(s,r) + (1,3) 


B B 
+f B(s,0)[ + +f t)at | do: 


Upon solving and simplifying, we obtain the last condition in the final form 


B 
A’(s,r) +A’ (r,s) + A’(s,0)A’(r,0)do 
(8) 


8 
= B’(s,r) + B’ (r,s) + 
Ja j 


THeorEeM I. Every self-adjoint integro-linear boundary condition may be 
reduced to the form 
U[u;s] =u(a,s) +u(b,s) r)A(r,s) 
+u(b,r)B(r,s)|dr=0, 
in which the Fredholm determinants of A and B® are not zero and their resolvent 


functions A’ and ®’ satisfy the relation (8). Conversely, every condition of 
this form is self-adjoint, provided, of course, that the condition (C) 


+0 


a 


is fulfilled. 


ae 
& 
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CoroLuary. When the integro-differential expression L | u] is anti-self-adjoint 
and the boundary condition U,;[u] = 0 is self-adjoint, the latter must have the 
form 


B 
= u(a,s) +u(b,9) — f (ula, 8) 


For, when L[ x] is anti-self-adjoint, R({*) = 1; so the condition (C) will 
not be satisfied when a(s) = — B(s) =1. 


9. THE GREEN’s FUNCTIONS 


In the theory of linear differential equations the conception of the Green’s 
functions enables us to write down in an explicit form the solution of a semi- 
homogeneous boundary problem consisting of a single linear differential equa- 
tion of the nth order, or a system of n linear differential equations of the first 
order, and of a system of n homogeneous linear boundary equations, whenever 
the reduced system is incompatible.* 

Following out this analogy, we are led to try to find a solution of a system 


(A, Bo) L[{u] =X(2,8), U[u] =0 


in the form 


(1) u(2,s) -{ ff acy, 


where H and G are independent of \. These two functions we shall call the 
system of Green’s functions for (4, By). We may arrive at such functions 
by imposing certain conditions of discontinuity suggested by the discon- 
tinuities of Green’s functions for differential equations. 

Let G (5%) be continuous, together with its first partial derivative with 
respect to 2, throughout the region Jz, let H be continuous, to- 
gether with its first partial derivative with respect to x, throughout each of 
the following regions: 


finally let H possess a discontinuity when x = y of the type 


* * Birkhoff, these Transactions, vol. 9 (1908), p.377; Bounitzky, Liouville’s 
Journal, ser. 6, vol. 5 (1909), p. 65; Bécher, Annals of Mathematics, 
ser. 2, vol. 13 (1911-12), p. 71. 


“89 
ac 
ve 
i 
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We use the notation H to mean lim H, € > 0; also 
e-0 
8 x 
) = tim ). 
yo tte 
Because of the continuity throughout 7; and 72, it is clear that 


‘yx+,8\ fy 8 
5 ). 


Replacing u by its value from (1), we find 


Liu} prey 
+f ae’) 


+4(z,0)H(**) d(y, 8) dy 


Hence, on account of (2), u as given by (1) satisfies (A ) for every continuous 
function (2, s) if and only if H (5°) and G (}{) respectively satisfy the 
equations 


(3) 


Both of these equations have to be considered separately in the regions 7; 
and 7;, although the formal work is the same. 


We may now regard the functions 


y y yt 


as the initial functions given at a fixed point y in the interval J,. These 
functions will at present be assumed to be continuous in their respective 
variables, and to satisfy condition (2); otherwise they are arbitrary, pending 
further determination. 

By (2), § 2, the solution of (3) is 


y y y 


in which the + signs correspond respectively to the regions 7;, T2. 


Se 
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The equation (4) may be solved by the result of § 2 in the form 
B 
yt y yt a yo yt 
x8 8 Et (ze o Et | 
=f He )+J ) ac 


“By Corollary III, Theorem I, § 2, this simplifies into 
ve Et 
8 )a 
+f (33) y 


Replacing H (§‘) by its values from (5) and making use of the definition 
(10), § 2, this further simplifies into 


where the + signs again correspond to the regions 7, and 7. It is important 
to observe that the function G (*;) thus determined is continuous through- 
out I,,J5:, because the only possible place of discontinuity is when z = y, 
but then S (7) = 0 by virtue of its definition. 

We are now to determine H (/*'*) and G(%*) so that the expression (1) 
also satisfies the boundary equation (Bo) for all X(z,s). Upon substitu- 
tion of (1) in U[u] we have 


| +f 
| 


We shall have U[u] = 0 for all X (2, s) if and only if the equations 


(7) + =0, 


are satisfied. 
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On substituting in (7) for H({%‘) and H(}") their values from (5), we 
obtain an equation in H (¥*'*) and H (}~*), which together with (2) enables 
us to find for these functions the values 


g(y,8) y 8) 


since we confine ourselves to the case g(y,s) + 0 ($4). It is convenient 


at this stage to introduce the following abbreviations which will be useful 
later. 


b 
n(y,s) = a(sR(**), = R( 


P 8 
(10) a( + f 


Thus according to the notations (6), § 3, we have* 


(11) g(y,s) =gily,s) +g2(y,8), a( = 1) + 


yt yt 


and the equations (9) may be written 


Now from the equation (6) we have 


yt y yt 
y—,t as 
yt 
8 
a” yo yt y yt 


* Note that G (, ) and G i are two entirely different functions. 
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whence 


We shall now substitute this value in (8) and also replace H({‘), H(>') 
by their values from (5). If in the resulting equation we replace H (}*’*), 
H (4~**) by their values from (9’), we find that (8) reduces to 


or 


if we write for short 
8 


8 
(13) 2(y,t) a(t), 


The kernel K (,,‘) is the same as that in the equation (7’), § 3. 

Now if the homogeneous system (Ao, Bo) is incompatible, then the kernel 
K(,+) possesses a resolvent function Q(,;) and the equation (12) pos- 
sesses a unique solution given by 


DEFINITION. The functions H(}°), are said to form a system 
of Green’s functions of the integro-differential boundary problem (Ao, Bo): 
L{u] =0, U[u] = 0, where U[w] is assumed to be integro-linearly self- 
independent and subject to the condition (C), if they are defined respectively 
in the regions J,, J, and I,, J, and possess the following properties: 

1. H(;*) is continuous together with the first partial derivative with 
respect to x in the regions 7; and 72, and 


y y 
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2. G(;+) is continuous together with the first partial derivative with 
respect to x throughout the region J,, Js:. 

3. Throughout 7, and 7, the functions H({*) satisfies the equations 
(3) and (7). 

4. The function G ({;) satisfies the equations (4) and (8). 

TueoreM I. When Green’s functions exist, the semi-homogenecous system 
(A, Bo) possesses a solution given by the formula (1). 

We have seen in the above deduction that Green’s functions exist if the 
system (Ao, Bo) is incompatible. Because of the fact (Theorem III, § 4) 
that when (Ao, Bo) is compatible not every semi-homogeneous system (A, 
By) can have a solution, it follows that Green’s functions do not exist for this 
case. Hence 

THEOREM II. A necessary and sufficient condition that a system of Green’s 
functions exist for a system (Ao, Bo), in which U is self-independent and (C) 
is fulfilled, is that the system (Ao, Bo) be incompatible. When this condition 
is satisfied, the solution given by (1) is the unique solution. 

The last fact follows from the Corollary to Theorem III, §3. From the 
theorem just stated, it follows that the equation (12) cannot possess a solution 
whenever the system (Ao, Bo) is compatible. Thus we have the 

CoroLitary. When a system (Ao, Bo), in which U is self-independent and 
fulfills (C) , is compatible, the function (13) cannot vanish identically; and 


fs 
f 9) F(, 4) 


does not vanish identically for every ;(y, 8) which satisfies 


= K(,")ar. 


THeoREM III. For a system (A, Bo) there cannot exist more than one set 
of functions H(5*), G(j¢) such that (1) ts a solution of the system for every 
A(2,8); and if such a set exists, it consists of the Green’s functions for the 
system. 

When the reduced system (Ao, Bo) is compatible, no such functions H 
and G@ can exist, because in that case not every semi-homogeneous system 
(A, Bo) can have a solution (Theorem III, § 4). When (Ao, Bo) is incom- 
patible, Green’s functions exist and (1) is the unique solution of (A, Bo). 
Hence if there exists another set of functions, H’ and G’, such that 


b b 
u(x, 8) = eddy + (24) acy, 


is also a solution of (A, Bo), this solution must be identical with (1) and 


t 
| 
& 
| 
| 
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therefore the difference of this and (1) is identically zero. Since A(z, s) is 
arbitrary, we find, by using the lemma in § 5, H’ = H,G’=G. 

If in the system (A, Bo) we replace the boundary condition U = 0 by 
another boundary condition U’ = 0, where U’ is an integro-linear function 
of U, then (1) will be obviously also a solution of the resulting system; hence 

CoroLitary. The Green’s functions of a system are invariant of the choice 
of boundary conditions, provided the different choices of boundary expressions 
are integro-linearly connected. 

Another important property is that there exists a symmetrical relation 
between the Green’s functions of the given system and the adjoint system. 
From Corollary I, Theorem I, § 6, it follows that the adjoint boundary con- 
dition V = 0 is self-independent. By reference to Theorems II, III, § 7, 
we infer from Theorem II: 

THeorEM IV. If the system (Ao, Bo) possesses Green’s functions, H, G, 
the adjoint system ( ye By) possesses Green’s functions, H,G. 

The solution of the adjoint semi-homogeneous system 


(A, Bo) =0 


is given by 


b _ 
(15) o(z,0)= f +f f u(y, tatdy. 


Let u(x, s) be the solution of the system (A, Bo) given by (1). Then, by 
Green’s theorem, 


f + =0. 


On the substitution of the values of u and v from (1) and (15), we have 


which holds for every \ and w. Hence, by the lemma in § 5, 


THEoREM V. The Green’s functions of adjoint systems satisfy (16). 
THEOREM VI. If two systems 


L{u] =X(2,8), , =X(2,8), 


(A, Bo) Ui [ul] (A’, Bo) Ui [u] 


q 

cA 
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have the same Green’s functions H(;*), and if the adjoint system“ 
(Ao : Bo) exists, then the expressions L and L’ are identical and U; is an integro- 
linear function of U,. 

Since the Green’s functions are the same for both systems, the function 
formed from them by the formula (1) satisfies both systems, hence it satisfies 
the homogeneous equation 


— L’[u] =0, 
that is, 


if we let 


(x, 8) (x, 8) — (2,8), 


(ei): 


If we substitute (1) in this equation, we find 


By the lemma of § 5, we obtain 


(2, 0)H ) =o, 


(17) 


Let us take the limit of the first of these equations as y approaches z first 
from above and then from below. This gives 


(2, ) =o, 


and by subtracting one of these equations from the other, we see from (2) 
that @” =°%. 

Substituting this value in the second equation (17), and replacing H and G 
by their values — H, — G, we find 


d 
b B 
xs 8 ze 
+f 
a Wa y t t 
= : 
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Now the first member of this equation is, by (15), the solution of the system 


~ = Vi[v] =0, 


regarded as equations.in y andt. Hence y’’ = 0. This completes the proof 
that L[u] and L’[u] are identical. 

Our theorem will be proved if we can show that U; and U; are integro- 
linearly connected. For this purpose we substitute in U; for w(a,s) and 
u(b,s) their values from (14), §6. This gives 


B 
Ui (s) = Mi(s)U,(s) +f Ni (8, t) Ux (t) dt 


+ M2(s)U2(s) +f N2(s, t) U2(t) dt, 
in which 
M2 (8) = — a (8)71(8) + Bi(s)61(s), 
No(s, t) = — a (8)Ci(t, + Bi(s)Di(t, 8) — Ai(s, t) y(t) 


B 
+ Bi (s, t)&:(t) +f [— Ai(s, r)Ci(t, 7) 


+ Bi(s,r)D,(t,r))dr. 


And if we can show that Mz(s) = 0, No(s, t) = 0, we shall have established 
an integro-linear relation between U; and U;. 

To prove M2(s) = 0, we make use of (7) and the corresponding formula 
for (A’, B,). We have, since (A, Bo) and (A’, B)) have the same Green’s 


functions, 
b 
+ wt, 


ai + i @. 


Now for each constant value the functions °°) and °°) cannot 
both vanish identically, because otherwise we would have from (5) both 
H(%~*°) and H(%*:*°) identically zero, which is impossible owing to the dis- 
continuity of H. Consequently 


— (8) Bi(s) + Bi(s)ai(s) =0. 
Hence, from (8), § 6, we have Mz, = 0. 
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To show N2(s,t) = 0, we have from the formule corresponding to (8) in 
the case of the systems (A, By) and (A’, By) 


The subscript 1 has been dropped from U, and V, for convenience, and the 
variable subscripts are inserted to indicate the variables operated on. On 
account of the relation (16) we have the identity 


rel 


which may be written 


x 


bt 


, t 


Expanding and collecting terms, we find 


+ 


+ (5")+ a(°")| ar =0. 


By means of the relations 


¥ 

yt,8 y 8 

H ( = H 

y 
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and (7), (16), (4), the first member of this equation reduces precisely to the 


expression N2(s,¢). Thus our proof is completed. 
CoRoLtaRy. A necessary and sufficient condition that the Green’s functions 


of a system be skew-symmetric, 1. e., 


(33) 
y x yt xs 


is that the integro-differential expression L[u] be anti-self-adjoint and the 
boundary condition U[u] = 0 be self-adjoint. 

The sufficiency of this theorem follows from Theorem IV, and the necessity 
from the theorem just proved. 


HarvarD UNIVERSITY 
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ON SCALAR AND VECTOR COVARIANTS OF LINEAR ALGEBRAS* 


BY 


OLIVE C. HAZLETT 


INTRODUCTION 


1. Relation to the literature. This paper concerns itself with two rather 
different kinds of covariants of the general linear algebra, which might, for 
convenience, be distinguished by the adjectives scalar and vector. Consider 
the general linear algebrat E with the units e,, --- , e, and with the constants 
of multiplication yi;, (7, 7, k = 1, ---, m), where the general number of 
the algebra is X = Diz;e;. In a previous papert we defined a rational 
integral covariant C of the algebra E as a rational integral function of the y’s 
.and the x’s which possesses the invariantive property whenever the units are 
subjected to a linear transformation. The paper just mentioned uses such 
covariants to characterize linear associative algebras in two and three units. 
The present paper proves for such covariants several fundamental theorems 


analogous to the fundamental theorems in the theory of invariants for algebraic 
forms. Since these covariants are isobaric and homogeneous, we can apply 
here Hilbert’s proof of the “ finiteness ’’ of the number of covariants of alge- 


braic forms. 

We must not, however, content ourselves with the study of such covariants, 
since they are not sufficient to characterize linear algebras, both associative 
and non-associative, even when there are only two units. Accordingly, we 
consider rational integral functions of the y’s, the 2’s and also the units 
e; (1 =1, «++, m) which possess the invariantive property. Since these 
functions involve the e’s, we call them vector covariants in contradistinction to 
the functions C mentioned above, which we might call scalar covariants. At 
first sight, it might seem as if we were confronted with a rather inconvenient 
difficulty for the following reason. In a theory of such covariants we would 
expect to find treated certain differential operators analogous to the familiar 
annihilators 2 and O; but Scheffers has shown that, in a linear algebra, a 


* Part I was presented to the Society December 27, 1917, under a different title; Part I 
was presented February 23, 1918. 

t For the fundamental definitions in the theory of linear algebras, see Section 2. The most 
familiar linear algebras are number fields (in particular, the field of reals and the field of 
ordinary complex numbers) and quaternions. 

tAnnals of Mathematics, second series, vol. 16 (1914), pp. 1-6. 
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derivative is not uniquely determined unless multiplication is commutative, 
and we are concerned with algebras both commutative and non-commutative. 
This Gordian knot can, however, be readily cut by a device. We can, accord- 
ingly, derive the annihilators in the approved manner, and hence show that 
every rational integral vector covariant of the linear algebra is a covariant of 
the general number of the algebra X = 22; e; and a suitable set of scalar 
covariants of the algebra. From this fact flow theorems analogous to those 
proved for scalar covariants and, in particular, the “ finiteness’ of vector 
covariants. 

2. Definitions. A linear algebra E over the field F is a set of hypercomplex 
numbers of the form X = }°j_, 2;e;, where the codrdinates x; range inde- 
pendently over F. Here the wnits e; are such that e;e; = Dies vijnex 
(1,7 =1,--+,m), where (i) the constants of multiplication y;;, are in F, 
(ii) the sum of two numbers X and Y of the algebra is Y¥ + Y = 
Dia: (2; + yi) e; and (iii) numbers of the algebra combine under addition and 
multiplication according to the distributive law. Unless explicitly stated, 
we do not assume the commutative nor the associative law of multiplication.* 

If we have a number X = > 2;e; of an algebra whose codrdinates are 
(ordinary) complex numbers, then we can find a number Y = Dy;e; + 0 
and a scalar w such that XY = wY if and only if the right-hand characteristic 
determinant 6(w) =| — djxw| is zero. Here dj; =1, dj, = 0 
if k +7. Similarly, there is a number Y + 0 and a scalar w such that 
YX = oY if and only if the left-hand characteristic determinant 


(w) =| Dias ti — 


vanishes. 
For convenience in the study of covariants of a linear algebra, we introduce 
the notion of weight. If we subject the algebra E to the transformation 


€, = de, (A # 0) 
(A )r 

= (1#r), 
the y’s are subject to the induced transformation where y;;, is unaltered when 
t,j,kAr, vir (4,j #1) is multiplied by antl yirr (2 7) are un- 
altered, is multiplied by \, and yjrx (j,k r) by Nand (k ¥ 
by \?. Hence we shall say that (7,7, k # r) is of weight 0, yijr (7,7 #1) 
of weight — 1 in é,, Yrir and Yirr (2 ¥ 1) of weight 0, Yrrr of weight 1, y,;, and 
(j,k of weight 1 and (k of weight 2ine,. For every unit 
e, we thus assign to yi;, a weight with respect to that unit, which is counted 
in the following way: the first subscript, 7, contributes + 1 if 7 = r, other- 


*For definitions of a linear associative algebra by a set of independent postulates, see 
Dickson, these Transactions, vol. 4 (1903), pp. 21-26. 
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wise 0 ; the second, 7, contributes + 1 if 7 = r, otherwise 0 ; and the third, 
k, contributes — 1 if k = r, otherwise 0. The sum of these three partial 
weights is called the weight of y;;, with respect to e,. 

For a similar reason, we shall say that e, is of weight 1 in e, but of weight 0 
in any other unit e; (¢ # r). 

If X = 2; e; is the general number of the algebra then if ---, 2%, 
are the coérdinates of XY when it is expressed in terms of the new units—that 
is, X = D2; e;—then x, = 2,/A and x; = 2;(1#r). For this reason one 
might be tempted to say that z, is of weight — 1 with respect to e, and that the 
other x’s are of weight 0 with respect to e,. In the applications, however, 
it will be better so to define weight that every x has a weight which is positive 
or zero with respect to any unit e,. Accordingly, we shall agree that 2, is of 
weight 0 in e, and that 2; (4 # r) is of weight 1 in e,. 

We shall use the term total weight of a term for the sum of the weights of 
that term in all the units e,, ---, e,. Notice that the total weight of any 
y, x; or e; is 1, and thus the total weight of a product of a number of y’s, 2’s 
and e’s is the degree of that term. 

If P is the product of a number of y’s (or a number of e’s) such that it is of 
weight w, in a particular unit, e,, it is multiplied by \"” under the transforma- 
tion (A),. If, however, P is the product of a number of 2’s such that it is of 
degree s in the 2’s and of weight w, in e,, then under the transformation (A), 
it is multiplied by A“"~*. 

Any polynomial in the y’s, the 2’s and the e’s such that all terms have the 
same weight with respect to a particular unit e, will be said to be isobaric 
with respect to e,. If the terms have the same total weight in all the units, 
the polynomial will be said to be isobaric on the whole. If P is a polynomial in 
the y’s, the 2’s and the e’s which is isobaric with respect to e,, of weight w, 
in e, and which is homogeneous in the 2’s of degree s, it is multiplied by 
under the transformation (A),. 


Part I. Scatar CovaRIANTS 
3. Some fundamental theorems. The invariancy of J under the transfor- 


mation 


n 
T: = >, dimen 


m=1 


A |dim| 0, 


is expressed by the formula 
(1) I (vist) = (aim) I (vise) 


where the y;;, are the constants of multiplication of the original algebra and 


x 
wre 
<7 
ay 
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the yi;« are the constants of multiplication of the transformed algebra. 
Here has been written for I (iu, Yi12, > Yann), O(@im) for 
Gi2, and for Yu2, Yann). Hence by 
applying the special transformation e; = de; (A + 0; i = 1, ---, m), it fol- 
lows that J is homogeneous in the y’s. 

Under this transformation 7 the y;;, are subject to an induced transforma- 
tion such that 


If we keep 7 and j fixed in (2) and let g range from 1 to n, we have n non- 
homogeneous linear equations in the n unknowns y;;, which can be solved 
uniquely, since A #0. Hence each y;;, is a homogeneous function of degree 
n + 1 in the a’s divided by A. 

Accordingly, if I (yijx) is a rational integral invariant of total weight w 
in the units and hence homogeneous of degree w in the y’s, I(yi;x) is a 
rational integral homogeneous function of degree (m + 1) w in the aj» divided 
by A”. But, since I (yi; ) is independent of the a’s , this implies that ¢ ( aim) 
is a rational integral homogeneous function ¥(aim,) of degree (n + 1)w 
in the a’s divided by A”. That is, 


I (vise) = I 


for every transformation of determinant A #0. Hence under the inverse 


I (vir) = I (vir), 


transformation 


where A,,; is the cofactor of a,;. Combining these two results, we see that 


Am 
a) =1; 
or, since y is homogeneous of degree (n + 1) w, 
V (dim) (Amz) = 
But, since A is irreducible, this can be true only if Y (dim) = kA“, where 
k is some constant. Then ¥(Am:) = kA™, where A’ = | Ami| = A", and 
hence u = w/n. By applying the identical transformation, it is evident that 


k = 1, and thus 


(3) (dim) = 


This last formula can be proved in the above manner familiar in the classical 
invariant theory, or it can also be proved for associative algebras by the 


Trans. Am. Math. Soc. 27 
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following device. If X = }2;e; is a number of the general n-ary linear 
algebra E = (e€,, --+, én), then EX = E= XE. That is, if we multiply 
each unit e; of E by X , we obtain n linearly independent numbers of E which 
can be taken as a new set of units. This remark still applies if we restrict 
ourselves to the general associative algebra. Accordingly, replace e; by 
Xe; (i =1,---,n). Then 


= (Xe;) (Xe;) = Le Yim) (Xen) 


Thus I (¥;;x) is of degree 2w in the y’s. But A’ is of degree rn and I (yijx) 
is of degree w in these y’s. Hence 2w = rn + w,orr = w/n. 

Also, if w; is the weight of J in e;, then wy = we = --- =w,. This 
follows from equations (1) and (3) and the fact that J must be unaltered 
(except possibly for sign) when any two units, say e; and e;, are interchanged. 

Since w/n is the weight of J in any particular unit, say e,, w/n is a positive 
integer. Notice, moreover, that w is equal to the degree of J, since each 
vijx is of total weight 1 in the units. Thus we have 

TuHeoREM I. For an n-ary linear algebra E = (e,, p €n) with constants 
of multiplication yi;,, let I be a rational integral invariant of degree d. Then 
I is homogeneous and isobaric of total weight d in all the units, and d/n is a 
positive integer. If we subject the units of E to the transformation T of deter- 
minant A # 0, carrying E into the algebra E’ with constants of multiplication 
Vin» then 

I (Vin) = AU 

The covariancy of the rational integral function C under the transformation 

T is expressed by the formula 


C (Vins a,) = (aim) C 


Such a covariant is not necessarily homogeneous in the coérdinates x; of the 
general number of the algebra, but it is the sum of covariants C; such that 
each C; is homogeneous in the 2’s. Accordingly, we may, without loss of 
generality, restrict ourselves to the study of homogeneous covariants. Every 
covariant C which is homogeneous in the 2’s is also homogeneous in the y’s. 
If C is homogeneous of degree s in the z’s and of degree d in the y;;;, then 
C(Y¥in is a rational integral homogeneous function of degree 
(n +1)d+(n —1)s in the a’s, divided by A%*. Thus ¢(a,,) in this 
case is a rational integral homogeneous function, y, of the a’s of degree 
(n + 1)d + (n — 1)8 divided by A***, and consequently 


y ( Aal ) y ) = A (n+1)d+ (n—1)8 


Thence it follows that 
¢(am) = ATOM, 
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When we subject E to the special transformation 
=e(t#n), = (A ¥ 0), 


each term of C (¥;;,) is equal to the corresponding term of C (;;,) multiplied 
by A“""*, where w, is the weight of that term in e,. This follows at once 
from the remarks at the end of section 2, since this is the transformation (\),. 
But under this transformation, C (7;;,) is multiplied by \“/" and thus 
Wn — = (d —8)/n or w, =([d+(n—1)s8]/n. It follows that the index 
of the power of A—namely (d — s)/n—is an integer. Moreover, it is not 
negative, for it is the weight of the coefficient of a term of C which is indepen- 
dent of z,. Now such a term actually occurs in C, as otherwise 2, would be 
a factor of C , and hence (in view of the covariancy of C) every linear function 
of the x’s would be a factor of C’, which is impossible. Thus we have proved 

TuHeoreM II. For an n-ary linear algebra E = (e,, «++, €n) where the con- 
stants of multiplication are i; and the general number of the algebra is X => x: e;, 
let C be a rational integral covariant of degree d in the yij; and degree s in the x;. 
Then C is isobaric of total weight d + (n — 1)8 in all the units. If C is homo- 
geneous in the x’s, it is homogeneous in the y’s. Moreover, if we subject the 
units of E to the transformation T of determinant A # 0, carrying E into the 
algebra E’ with constants of multiplication y;;x, then 


C = AS" C 21). 


The index, (d — s)/n, is an integer, positive or zero.* 

4. Fundamental sets of invariants and covariants for n = 1,2. When 
n = 1, the multiplication table is e; e, = ye; and the general number of the 
algebra is 2, e,. Hence 6(w) = 6’(w) = w — ya,; and the only rational 
integral invariants in this case are powers of y, and the only rational integral 
scalar covariants of the unary algebra are all polynomials in y and 2. 

When n = 2, the multiplication table is of the form e; e; = Yij1 €1 + Yij2 2 
(7, j = 1, 2) and the general number of the algebra is X = 2; e; + 22 @. 


Hence 
5(w) w —w[T, 21+ + [Tsai +75 22], 


5’ (w) w —w[T3a, +1727 + + T3232], 
where 
Ty = Yiu + Yi22; You + Y222, 


= Yin + = + Y222; 


* By the use of this theorem, we can readily prove the following corollary: Every rational 
integral invariant and covariant of the general linear algebra vanishes for a nilpotent algebra. 
This theorem was stated and proved for invariants in a paper in the Annals of Mathe- 
matics, second series, vol. 18 (1916), p. 84, and the proof for covariants proceeds in essen- 
tially the same way. 
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= ¥111 Y122 — Yii2 Yi215 = You Y222 — Y212 Y2215 
= Y212 — Y2115 I's = Y222 — Y122 Y2215 
= Yo22 + Yo11 Yi22 — Yi21 Y212 — Y2215 
= Y222 + Yi21 Yo12 — Yi22 — ‘Y112 Y221- 
We may prove without any difficulty that, for a binary algebra, every 
homogeneous isobaric function of the constants of multiplication for which 
Ww, = wz and which is annihilated by 


0 0 
U 9 = — 
2 = + + (Y222 1121) + — Ye) 


+ [(-ya12 + Yi22) + | 


is an invariant of the algebra. We can prove a similar theorem for scalar 
covariants. Accordingly, one may readily determine the rational integral 
scalar covariants of a given weight, and so one might hope to find a fundamental 
set of covariants for the binary algebra. 

One can, however, avoid such troubles in view of 

TueoreM III. Every scalar covariant of the binary algebra E = (e, €2) 
where the constants of multiplication are ij, and where the general number of 
the algebra is X = x, €; + 22 €2 is a covariant of the characteristic determinants 
5 and 6’. 

We want to show that any rational integral function C (yi;x ; 21) of the 
eight y’s and the 2’s which has the invariantive property under the group of 
linear transformations on e, and é2 is a function of T,, ---, I's and the 2’s 
which is a covariant of the system of forms T, 2; + Tea, [T32, + 1422, 
Tsai + + 23,07 27 + 2) + I's under the group of linear 
transformations on x; and x2. Since x; and 22 are contragredient to e; and 
e2, C (vise ; X1) has the invariantive property under the group of linear trans- 
formations on 2, and 2x2 where the y’s are transformed according to (2). Thus 
we can prove our theorem if we can show that any function of the y’s is a 
function of T,, ---, I's. Now this follows from the fact that the Jacobian 
of T,, ---, Ts is 


_lo 
| Y122 
10 


J 
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0 0 0 0 0 0 1 
1 0 0 0 0 1 0 
0 0 0 0 1 0 0 
1 0 0 1 0 0 0 
0 — 0 0 0 Yiu 
Y211 0 Vass 0 Y222 0 
0 — 0 0 Yur — Yu 0 
Y121 0 — Vise Y222 0 0 — You | 
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which is not identically zero, since the terms free of yii2, Y111, Y222 and Yoo 
equal the four-rowed determinant in the lower left-hand corner. 

5. Finiteness of the rational, integral scalar covariants. In view of the 
homogeneity and isobarism of rational integral scalar covariants, we can 
readily prove that they are all expressible as polynomials in a finite subset. 
For, by Hilbert’s theorem about an infinite sequence of polynomials,* any 
such covariant C can be expressed in the form 


(5) C(y; = 2) 2) 


where the C; are scalar covariants of the algebra which are the same for every 
C. Since C and C; are homogeneous and isobaric, we may suppose that the 
K; are also. If we subject FE to the transformation 7 of determinant A + 0, 
(5) is replaced by 


(6) Av C(y; = Kj (7; 2’) x), 


where C is of index w and C; is of index w;. Now operate on both sides of 
(6) with 


= 


until both sides of the resulting equation no longer involve the a’s. Then, 
by a well-known theorem of Hilbert’s (which is essentially one given by Mertens 
and Gordanf), (6) becomes 


NC(y; = DN; Ci (7; x) 2), 


where the C; are scalar covariants of the algebra, and N and N; are constants 
different from zero. Thus we prove by induction 

THEOREM IV. For the general n-ary linear algebra there is a finite set Ci, 

-, Cn of scalar covariants such that every rational integral scalar covariant of 
the algebra is a polynomial in C,, «++, Cn. 

Corotiary. For the general n-ary linear algebra there is a finite set 
B,, --:, B, of rational scalar covariants such that every rational integral 
scalar covariant of the algebra is a covariant of this set of algebraic forms, 
For convenience, such a set of covariants will be called basic. 


*“Ueber die Theorie der algebraischen Formen,’”’ Mathematische Annalen, 
vol. 36 (1890), p. 474. 

+ Hilbert, loc. cit., p. 524; Gordan, Beweis, dass jede Covariante und Invariante einer 
bindren Form eine ganze Function mit numerischen Coefficienten einer endlichen Anzahl solcher 
Formen ist, Journal fiir Mathematik, vol. 69 (1868), pp. 323-354; Die simultanen 
Systeme bindrer Formen, Mathematische Annalen, vol. 2 (1870), pp. 227-280; 
Invariantentheorie (1887), vol. II, §9; Mertens, Beweis dass alle Invarianten und Covarianten 
..+Journal fir Mathematik, vol. 100 (1887), pp. 223-230. 
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Part II. Vector CovaRIANTS 


6. Preliminary remarks. As we pointed out in the introduction, scalar 
covariants are not sufficient to characterize linear algebras—in fact, they are 
not sufficient to characterize potent algebras in 2 units, for the two algebras 


aa =, = 0, = (¥ > 0); 
=e, = ey 0 


are not equivalent with respect to the field of reals, and yet for both algebras 
5 = 6’ = w(w — 2,) and both have the same rank and the same index. 

Accordingly we shall now consider vector covariants. By a rational integral 
function of the y’s, the 2’s and the e’s we shall understand any linear com- 
bination of a finite number of products of units of the algebra where the 
coefficients are polynomials in the y’s and 2’s. It must be borne in mind 
that, since multiplication is not in general commutative nor associative, 
we cannot, for instance, combine into one term two such as é¢; [ (é€; é2 ) e3] and 
(e; €:) (€2e3). That is, in some terms the multiplication is to be carried 
out in one order and in other terms in a different order. Every rational 
integral vector covariant V is, however, the sum of a finite number of vector 
covariants V; such that, in any one V;, the multiplication indicated in every 
term is to be carried out in the same order. 

7. Fundamental properties. We can readily show that if a vector covariant 
is homogeneous in the units, e;, and homogeneous in the 2;, it is homogeneous 
in the yijz. Moreover it is isobaric in any unit e;, and its weight in e; is 
equal to its weight in e; (7,7 = 1,---,m). From these properties we can 
readily prove the analogue of Theorem II, which we state here merely for the 
sake of completeness. 

THeoreM V. For an n-ary linear algebra E = (€,, +--+, €n) where the 
constants of multiplication are ij, and the general number of the algebra is 
X = Dax;e;, let V be a rational integral vector covariant of degree d in the 
vYijx, Of degree s in the x’s and of degree v in the e’s. Then V 1s isobaric of 
weight w = d+(n—1)s8s +0 in the units altogether. If V is homogeneous 
in the x’s and the e’s, it is homogeneous in the y’s. Furthermore, if we subject 
the units of E to the transformation T of determinant A + 0, carrying E into 
the algebra E’ = (e;, «++, €,) where the general number of the algebra is 
X’ = D 2x; e; and the constants of multiplication are y;;%, then 


V (Vises 213 Cn) = AS V 213 Om). 


Moreover the index (d — s + v)/n is an integer, positive or zero. 
8. Annihilators for vector covariants. Since we are dealing with a linear 
algebra, every vector covariant whose degree in the units is greater than 2 
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can be made linear in the units. Henceforth, unless otherwise specified, we 
shall think of every vector covariant as linear in the units. 

First consider absolute vector covariants of degree 1 in the x’s. Now such 
a covariant is of the form 


n 


(7) das (7) + bei + + (7) 1 

where each ¢;; is a polynomial of weight zero in each unit, and each ¢;; (7 + 7) 
is a polynomial of weight 1 in e;, of weight — 1 in e;, and of weight zero in the 
other units. But any polynomial in the y’s of degree d is of total weight d 
in the e’s altogether. Hence each ¢;; = 0 (i + j) and each ¢,;; is a constant, 
c;. But, since (7) is unaltered if we interchange any two units, the ¢; are all 
equal, and thus a vector covariant which is of the first degree in the 2’s is 
necessarily a constant multiple of the general number of the algebra. 

Next, consider absolute vector covariants of degree 2 in the 2’s ; such a 
covariant is necessarily of the form 


n 


i 


where each ¢;;; is of weight 1 in e; and of weight zero in e; (1 +7), each 
oii (4 + 7) is of weight 1 in e; and of weight zero in e; (1 +7), and each 
ojxi (J, & + 7) is of weight — 1 in e;, of weight 1 in e; and in e; and of weight 
zero in e; (1 +7, j,k). Accordingly each ¢ is of degree 1 in the y’s. In 
particular, 


dir = + B(Y122 + V133 + Yinn) + + + nin), 
= + € (323 + + + + + + Venn) + OY 222, 
231 = KY231 + AYs21, = MY221; 


and any ¢,;; is obtained from ¢,;; by interchanging the subscripts 1 and 7; 
any $;;: (j + 7) is obtained from ¢ 2; by interchanging 1 and 7, and 2 and 7; 
and so forth. Hence the difference between (8) and BC,,; X + yCiiX 
+ (a — 8B — y) X* is a covariant of the type (8) where the a, 8, y are zero. 
Here C,_; and C)_, are the coefficients of w"~! in 6(w) and 6’ (w) respectively. 
Hence consider the covariant (8) where a, 8, y are all zero. Since such a 
covariant must be unaltered under the transformation 


=ate, (i+1), 


we must have 6 (©=h=p=0. Therefore every ra- 
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tional integral absolute vector covariant of degree 2 in the 2’s is a homogeneous 
polynomial of degree 2 in C,,, and X = 2; e;. 

More generally we have 

TuHeoreM VI. Every rational integral vector covariant of the n-ary linear 
algebra is a covariant of the general number of the algebra and any basic set of 
scalar covariants of the algebra. 

We prove this by using a set of annihilators for vector covariants—differ- 
ential operators analogous to the familiar Aronhold operators 2 and O. In 
deriving such operators for the scalar covariants, we used Taylor’s theorem 
with a remainder for polynomials in the y’s and the z’s, and so we now seem 
to be confronted with the necessity of inquiring if this theorem (or one closely 
analogous) holds for polynomials in the y’s, the 2’s and the e’s—which we 
might, for brevity, call vector polynomials. This brings forward the question 
of a differential calculus for such vector polynomials. 

Now Scheffers* has considered the theory of functions of hypercomplex 
variables. In a linear algebra over the field C of ordinary complex numbers 
in which division is generally possible, he considers functions of the form 
f = X fie: where each f; is an analytic function of the z; (j = 1, ---, n). 
When 

dx = dz; e;, 


Of; 
df = aes 


Then Scheffers shows that such a linear algebra which possesses a principal 
unit permits of functions f with derivatives with respect to x which are inde- 
pendent of Az,, ---, Ax,—that is, independent of the “ direction ” in which 
Az approaches zero—if and only if multiplication is commutative. 

It would seem, then, as if we could not hope to find annihilators for vector 
covariants of the general n-ary linear algebra. There are, however, several 
ways of avoiding this little difficulty. 

As we pointed out at the beginning of this sect’on, we can think of a vector 
covariant as linear in the e’s ; and so, to derive the desired annihilators, we 
need only to consider such derivatives as 


0e; 0e2 


Oe; Oe; 


With this end in view, let us consider the linear function @(e,) = e¢,. Re- 


* Verallgemeinerung der Grundlagen der gewéhnlichen compleren Functionen, Leipziger 
Berichte, vol. 45 (1893), pp. 828-848; vol. 46 (1894), pp. 120-134. There are abstracts 
of these articles in Paris Comptes Rendus, vol. 116 (1893), pp. 1114-1117, 1242-1244, 
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placing e; by e; + Ae, where the increment Ae, is a scalar multiple of e2, say 
mez, we have Ad = Ae,;. Now divide on the right by Ae, i. e. find a number 
u such that u-Ae, = Ae,. If we are dealing with any particular algebra, 
this number u depends on that algebra; but we are interested in the general 
algebra, and thus we want a number wu such that w-Ae, = Ae; (or u-e2 = e2) 
for every n-ary linear algebra. One such number uw is the scalar 1. If we 
take u = 1,.de,/de, = 1. With a similar understanding, 


dex 


des 


Oe; ~ 0, 


Similarly for the derivatives with respect to the other units. 

The advantage of defining these derivatives in this way is that we can 
differentiate with respect to any unit, such as e,, in a purely formal manner— 
precisely as if we were differentiating with respect to a scalar. Thus a 
vector covariant which is linear in the units is annihilated by the differential 
operators 


04;; Ox}, 04;; 
where the U;; are the annihilators for the invariants of the n-ary algebra. 

Or we may reason as follows. 

Every vector covariant when linear in the units has these annihilators, 
since such a covariant has the invariantive property when the e’s are replaced 
by a set of cogredient scalars. 

Theorem VI now follows at once from the fact that the annihilators for the 
vector covariants of the algebra are the same as those for covariants of a basic 
set of scalar covariants of the algebra and the general number of the algebra, 

9. Finiteness of vector covariants. We now readily prove 

THEOREM VII. For the general n-ary linear algebra there 1s a finite set V1, 

-, Vm of rational integral vector covariants such that every vector covariant of 
the algebra is a linear function of Vi, «++, Vm. 

For every vector covariant of a linear algebra may be made linear in the e’s. 
Now thinking of the e’s as replaced by a set of cogredient scalars, which we 
shall call the y’s, we have a set of rational integral functions of the variables 
2; (1 =1, ---,m) and the contragredient variables y; = 1, ---,n) to- 
gether with the n* parameters yi; (2i;, k = 1, ---, n) which are homogeneous 
and isobaric and which have the invariantive property under the total linear 


B->. 
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group on the n variables 2;. Accordingly Hilbert’s “ finiteness 
applies. Or this theorem can be proved as a corollary of Theorem VI. 
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